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THE L 2 RESTRICTION OF THE EISENSTEIN SERIES TO A GEODESIC 

SEGMENT 

MATTHEW P. YOUNG 


Abstract. We study the L 2 norm of the Eisenstein series E(z, 1/2 + iT) restricted to a seg¬ 
ment of a geodesic connecting infinity and an arbitrary real. We conjecture that on slightly 
thickened geodesics of this form, the Eisenstein series satifies restricted QUE. We prove a 
lower bound that matches this predicted asymptotic. We also prove an upper bound that 
nearly matches the lower bound assuming the Riemann Hypothesis (unconditionally, the 
sharp upper bound holds for almost all T). Finally, we show the restricted QUE conjecture 
for geodesics with rational endpoints. 


1. Introduction 

1.1. Background and statements of results. The behavior of eigenfunctions of the 
Laplacian on an arithmetic surface T\H is a rich subject with connections to analytic number 
theory and spectral geometry. One of the foundational questions in the area is the Quantum 
Unique Ergodicity (QUE) conjecture of Rudnick and Sarnak [RS j which is now a theorem 
of Lindenstrauss [Lj (complete in the compact setting), with input by Soundararajan (So] to 
cover congruence subgroups. 

It is desirable to understand the (non)-localization of Laplace eigenfunctions on a nega¬ 
tively curved finite volume manifold, measured via L p restriction norms. For instance, one 
may fix a curve on r\H, and study the analytic behavior of the eigenfunction restricted 
to the curve. Such questions were initially raised by Reznikov 0 who used representation 
theory techniques for their study. Later, Ghosh, Reznikov, and Sarnak [ GRS] obtained esti¬ 
mates for the Lr norm of a Maass form along special curves such as horocycles or geodesic 
segments, with the application of counting sign changes of the Maass form along the curve 
(see also work of J. Jung [J] for further progress in this direction). Marshall |MJ has used 
the amplification method to estimate the Lr norm of a Maass form restricted to an arbi¬ 
trary geodesic segment, with a power saving over the local bound. Toth and Zelditch [ TZ] 
and Dyatlov and Zworski [DZ] have shown a quantum ergodic restriction theorem that in 
particular holds on a generic geodesic in a compact hyperbolic surface. 

The author [Y] recently showed that the Eisenstein series Et(z) = E(z, 1/2 + iT) for 
T = SL -2 (Z) equidistributes along the geodesic connecting 0 and ioo (this is an analog of 
QUE but along the curve). The Eisenstein series is a good test case for more advanced 
questions since it behaves in many respects like the cusp forms, yet is more explicit and 
easier to handle. In this paper, we study the behavior of the Eisenstein series restricted to a 
geodesic segment of the form {x + iy : (3 > y > a > 0}. This is not a simple modification of 
our previous work and the tradeoff in generality is that the results here are not as complete 
as in [Y] , 
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There are two overriding differences when one generalizes x to be arbitrary instead of 
x = 0. To aid in this discussion, we quickly gather some notation from [Yj. Let E^(z) = 
|e(i/2+iT)| Et(z), where 0 (s) = n~ s T(s)(( 2 s). This makes E 0 real-valued for z E HI. The 
Fourier expansion of E£ is 

(1.1) E' T (x + iy) = ra ‘/ 2 +iT + + p . m V IirhhhL V T (2n\nly). 

n^O 

Here p*( 1) = (2/tt) 1 / 2 |6»(l/2 + iT)\~\ p = ff^+irjp Vt (v) = VPMj/), and nrin) = 
J2ab=\n\( a /by T ■ The most important difference is that when x = 0, the coefficients e{nx)riT{n ) 
of E?p are multiplicative, while for general i 6K they are not. This causes some crucial as¬ 
pects of [Yj to break down, and we are only able to partially replace these with methods 
that work for general x. 

The other significant difference comes from an asymmetry between n > 0 and n < 0 
in (II.ip . Note that the A"-Bessel function in the Fourier expansion only depends on |n|, 
which has the following practical effect. When one considers a ^-integral of \E^(x + iy )| 2 , 
analyzed by squaring out and integrating the Fourier expansion term-by-term, there will 
be diagonal terms and off-diagonal terms. The diagonal parts will include a sum with say 
7 ii = 77.2 as well as terms with ri\ = — rz 2 , the latter of which will then lead to sums of the 
form \ T iT{p)\ 2 e{2nx). It is also necessary to treat the opposite sign off-diagonal terms of 
the form + h)e((2n + h)x). It is quite difficult to estimate such sums 

because the weight is highly oscillatory (for general x, that is-the case x = 0 for example 
does not have this problematic feature). We are able to sidestep this particular problem 
by “thickening” the geodesic segment with an extremely short ^-integral. This ^-integral 
is required to be barely longer than the Planck scale of length 1/T; on smaller scales, the 
Eisenstein series is roughly constant. This thickening procedure gives just enough extra 
savings to bound the terms with opposite signs, but has no practical effect on the terms with 
the same sign. 

Now we state our main results. Suppose that ^ is a smooth function with support on 
[a,/?]. Define 

(1.2) I^(x, T) = f ip 2 (y)\E T (x + iy)\- — . 

Jo y 

To get our bearings, we quote two results from the literature. Suppose that 0 : S'L 2 (^)\HI —* 
M is smooth and compactly-supported. Luo and Sarnak pa showed 

(1.3) [ if-— — l°g(l/4 + T 2 ) [ 0(z)^ + o(logT), 

Jsl 2 (z)\ h V ft Jsl 2 (z)\ h y 

as T —> oo. The interested reader can find a full main term with a power saving error term 
in (Yj (5.35)]. The author |Y] showed 

(1.4) /p(0,T) = 2-log(l/4 + T 2 ) /“"VO/)—+ o(logT). 

7T Jo y 

Again, one has a fully developed main term with a power saving error term. The Eisenstein 
series (squared) is twice as large on the distinguished geodesic x = 0 accounting for the 
opposite sign diagonal terms which are smaller for typical x. 
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For 7 > 0, define 
(i.5) j; ; 7 Oo,t) = 


I ip 2 (y)\E T (x+ iy)\ 2 clx—, 

'\x— a;o|<7^ V 


which we wish to compare to the (weighted) measure of the set, that is, 

poo p 

( 1 . 6 ) ~ - 1 1 




, f 2 (y)dx— 
'\x—xo\<ji y 


Here I* represents the thickened geodesic integral. Note x for 0 fixed. 
Our first result is a lower bound of the correct order of magnitude. 


Theorem 1.1. Suppose 7 = 7 (T) —)■ 00 as T —y 00 , arbitrarily slowly. Then uniformly in 
x 0 € R, we have as T — > 00 , 

(1.7) 0 , T) > ^ log(l/4 + T 2 )m^(l + o(l)). 

To partially complement Theorem 11.11 we have the following upper bound. 


Theorem 1.2. Uniformly in x € R, we have 

(1.8) / v ,(x,T)«(logT) 2 . 

Alternatively, we have 

(L9) t ^ , (iogiogr ) 2 


U(x,T) < !ogT(l + 


IC(l + 2iT)| ! 


Assuming the Riemann Hypothesis, we have |£(1 + 2*T)| 1 <C log log T (see [T, (14.8.3)]), 
and so 


(1-10) I^(x,T) « (log T)(loglog T) 4 . 

It is also known that |0(1 + 2iT)\~ 1 <C log log T for “almost all” T (see |GS| ). so that (jl.9j) 
is almost always stronger than (II.8|) . 

One may expect that the Eisenstein series equidistributes along any fixed geodesic. 


Conjecture 1.3. Under the same conditions as Theorem [7771 we have 

( 1 . 11 ) 4 c( Xo ’ T ) = - lo g(l / 4 + T 2 )m^ 7 (l + o(l)). 

7r 

The averaging over x in Conjecture 11.31 is important because some individual geodesics 
have more mass than others (e.g. x = 0 as we mentioned earlier). This averaging is enough 
to wash out the exceptional behavior on x = 0 . 

The extremely thin ^-integral appearing in Conjecture II.31 means that the methods used to 
prove QUE for Eisenstein series are not suitable for this set. Harmonic analysis on SX 2 (Z)\HI 
is not appropriate on sets of this shape. 

We are able to report some partial progress on Conjecture 11.31 To do so, we need to 
describe the earliest stages of analysis of I^{x,T). By a use of Parseval’s formula (in the 
Mellin transform setting), I^(x,T) decomposes as 

1 f°° 

(1.12) Ij,(x,T) = — \F xT {it)\ 2 dt, 

J —00 
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where 

(1.13) F xT (s) — f ^(y)y s E^{x + iy) — . 

Jo V 

We will then insert the Fourier expansion (11,11) and calculate the ^-integral in terms of 
gamma functions (see (12.ip below). One arrives at an integral of the form 


(1.14) 


|C(l + 2iT)| 


-2 


[ (1 + |t-T|)- 1/2 T~ 1/2 | V 

J \t\<T ' n ^ 0 


T iT (n)e(nx ) 

| n |l/2 +it 


k[n, t, T ) 


dt, 


as well as some other terms of lesser importance. Here k is an innocuous weight function, 
which restricts the support of n to \n\ x T 1//2 ( 1 + \t — T\Y^ 2 . The integral naturally decom¬ 
poses into pieces of the form \t — T\ x A where 1 A = o(T), as well as the “bulk” range 
with \t\ < T( 1 — o(l)). The diagonal term in the bulk range is responsible for the main term. 
This part of the ^-integral is also where the difficulties lie with the opposite sign off-diagonal 
terms. The device of incorporating the extra x-integral allows us to bound the opposite sign 
terms trivially. For A = o(T), we do not expect to obtain a main term, and so we can use 
Cauchy’s inequality to separate the positive and negative values of n, thereby avoiding this 
tricky problem in this range of A’s. 

Define 


(1.15) 


I(A,T,x,N) 




n> 1 


T iT (n)e(nx) /n\ 
n l/2+it W2 \n) 


2 

dt, 


where w\ and w 2 are some fixed smooth compactly-supported weight functions, and N = 
VAT. Roughly speaking, to turn Theorem II.II into an asymptotic formula, we need to show 
that 

(1.16) ^ (AT)- 1/2 /(A, T, x, N) = o(|C(l + 2zT) | 2 log T). 

1«A =o(T) 

A dyadic 

This is the same as showing 

(1.17) /(A, T, x, N) = o(|C(l + 2FT)| 2 1V), 

on average over A ranging over the O(logT) dyadic segments. The mean value theorem for 
Dirichlet polynomials shows that I(A,T,x, N) |£(1 + 2iT)\ 2 (A + A r )logT, which gives 
rise to (11.81) . This shows that to prove Conjecture 11.31 we need to save slightly more than 
logT in our analysis of J(A,T, x, N). 

Estimating (II. 15[) is the main subject of interest in this paper. The qualitative behavior 
of the integral changes drastically as A ranges from 1 to T. We are able to estimate it 
satisfactorily for A sufficiently large, or sufficiently small, with uniformity in x. 


Theorem 1.4. Suppose 7 = 7 (T) — y 00 . Then 


(1.18) 


-f 

./|i|<T—C(e)T 2S / 27 + e 



a: 0 |< 


X 

T 


\F XtT (it)\ 2 dt 

= l log ( 1/4 + + o( 7 -'/s + . 
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Theorem 11.41 implies Theorem 11.11 by positivity. Actually, to prove Theorem 11.11 we only 
need to evaluate the integral over \t\ < T( 1 — o(l)) which is somewhat easier. 

For the smaller values of \t =|= T|, we have 

Theorem 1.5. Let 0 < e < 1/4. Then 

(1.19) [ \F X)T [it)\ 2 dt 1. 

7T £ <|t=FT|<T 1 / 2 - £ 

The very smallest values of \t =F T\ require a slightly different proof, and we claim this 
estimate with 


Proposition 1.6. We have 


(1.20) [ \F X)T (it)\ 2 dt « (logT) £ . 

7|t—T|<TV1°0 

In light of these results, we see that in order to prove Conjecture 11.31 we need to show 
f T i/ 2 _ e< | t _ T | <cT 25 / 27 + e \ Fx ,t {it)\ 2 dt = o(logT). It would therefore be of great interest to close 
the gap to treat these medium-range values of A. 

In jY], Conjecture 11.31 was proved in the special case xo = 0 (without requiring the x- 
integral). The key difference for this case is that e{nx o) = 1 is multiplicative. More generally, 
if Xo is rational, then e(nx o) can be expressed as a short linear combination of multiplicative 
functions (changing basis to Dirichlet characters), which leads to the following 


Proposition 1.7. Suppose that xo is rational. Then if 7 —)■ oo, but constrained by 7 
for some small S > 0 , then 


( 1 . 21 ) 


Q, 7 (zo ,T) = -log(l/4 + T 2 )m tfi1 (l + o(l)). 


1.2. Outline and overview. In this section, we give a simplified and informal presentation 
of the method of proof of the primary results of this paper. The comments here should not 
be taken literally, but instead should serve as a guide for reading the rest of the paper. 

As mentioned earlier in the introduction, we have 

-2 [ T n , 1 rr ^- 112 ^- 1/21 v- F T {n)e{nx) 2 


( 1 . 22 ) 


4(x,T)w|C(l + 2iT)| 


[ (1 + |T-f|)- 1 / 2 T- 1/2 | V 

J 0 1 „ 


\ n \l/2+it 


dt. 


In reality, the sum over n contains a weight function depending on t, T, n, and we have 
only displayed the part of the integral with t > 0 , the negative values of t being similar by 
symmetry. Cutting the f-integral into regions \t — T\ x A with 1 C A <C T, we have 

(1.23) I^(x,T) ~ |£(l + 2fT)| -2 Y, (AT)- 1 / 2 / 

1«A«T «/|t-T|xA 

dyadic 


E 

|n|x7V 


T iT [n)e[nx) 


| n |!/2+j 


where 


(1.24) 


N = (A T) 1/2 . 


The mean value theorem for Dirichlet polynomials, along with a bound on the average size 
of |t;t(r )| 2 (see Lemma 12.41 below) essentially shows that 

(1.25) Ii/}(x, T) -C (AT)- 1/2 (A + (AT) 1 / 2 )logT<log 2 T. 

1«A«T 

dyadic 
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This (roughly) explains how to derive (11.81) . and in light of Conjecture II.31 indicates that it is 
desirable to save slightly more than a factor log T over this trivial bound. More precisely, the 
diagonal terms correspond to the term A inside the parentheses, and the term (AT ) 1 / 2 x N 
corresponds to the off-diagonal terms. One observes that the diagonal terms do indeed give 
a final bound of logT to /p(x, T), and a more careful analysis (which is easy to perform for 
the diagonal terms) leads to the asymptotic predicted in Conjecture 11.31 
Opening the square and integrating, we have 


( 1 . 26 ) 




A 


(AT ) 1 / 2 


E 


Ti T {n)Ti T {m)e((n - m)x) 


|m|,|n|xAT 


1 77777 1 U 2 ! 777/77 1 1 ' 


iT 


It is difficult to deal with the terms with m and n of opposite signs, because one encounters 
a shifted convolution sum with a highly oscillatory weight function. Even the opposite 
diagonal case with m = —n is problematic. One may partially circumvent this problem by 
introducing an additional ^-integral that is very short (of length y/T where 7 = y(T) —> 00 
arbitrarily slowly with T). The point is that the x -integral gives a small savings from the 
oscillation of these opposite sign terms. Indeed, we have 

(1.27) f e((n — m)x)dx • < 7 -, 

J\x-x 0 \<X N 

provided m, n x N and have opposite signs. Meanwhile, the trivial bound on the integral 
is j;, so this technique gives some savings for N somewhat larger than T/y (so this breaks 
down for values of A somewhat smaller than T). For this reason, we rearrange (11.231) in the 
following way. For A ?/T, where 77 = 77 (T) —>■ 0 slowly with T (77 will depend on y), we 
use this method to bound the opposite sign terms in a satisfactory way. In this same range 
of A, we can solve the shifted divisor sum 


(1.28) ^ TiT{n)Ti T {n + h), 

ni<N 

with a power-saving error term (indeed, the result of jYJ allows for A j' 2 ^l 2l + e y l n all, this 
gives an asymptotic for the part of I^(x,T), with A 3 > 77 T, with the additional x- integral. 
By the positivity of the right hand side of (II.231) . for proving a lower bound, we can drop all 
the other values of A. This is the reasoning that leads to Theorem 11.11 

In light of Conjecture 11.31 it should then suffice to give an upper bound only for the right 
hand side of (jl .23|) with A -C r/T (since the unconditional lower bound of Theorem [Tj] comes 
from A 77 T, and matches the asymptotic of Conjecture 11.31) . This pleasant fact allows one 
to simplify the analysis by using the inequality | Xmyo° n l 2 — 2| E n >o °™l 2 + 2| a n | 2 . 
Using this allows us to essentially replace the right hand side of (11.261) with 777,77 > 0, when 
A <C 77 T. Thus, for all values of A -C 77 T, one can completely avoid the opposite sign 
problem (indeed, (11.271) is trivial for smaller values of A, so one cannot rely on this idea in 
this range). 

The solution of the shifted divisor sum allows for all A ^ j" 25 / 27 + £ to be acceptable. 
This explains the range of applicability of Theorem 11.41 In estimating this shifted divisor 
sum, there is also an unexpectedly delicate matter with an off-diagonal main term, given by 
(14.23[) below. A trivial bound shows only that this main term contributes to /(A, T, x, N ) an 
amount that is 0((AT) 1 / 2 |<('(1 + 2iT);~), which just barely fails to satisfy the desired bound 
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(I1.17p . However, a finer analysis of the sum over the shift h gains a slight saving which 
allows us to save a factor of logT on average over the dyadic sum over A. 

When A is small, the shifted divisor sum approach breaks down. Another approach to the 
problem is to open up the divisor function T^in) = J0 ab=n (a/6)* T , and use the sums over a 
and b in some way. Arguments in this vein appear in Section [5] For example, we have that 
for a given value of N, that 


(1.29) 



T iT {n)e(nx) 

n l/2+it 



EE 


e(abx) 

a l/2+it—iT yt /2+it+iT 


2 


where AB x N (observe that once b is localized by b x B, then automatically a x N/B ). 
This comes from breaking up the sum over a and b into dyadic segments, and applying 
Cauchy’s inequality. It turns out that if A A 1 / 2 " 1 " 6 , then one can get a savings from 
Poisson summation on a inside the square (and prior to any f-integration), which can then 
be combined with savings from the t-integral. Similarly, if B /$> T 1//2+e , then there is savings 
from the 6-sum. Since AB = N = (AT) 1 / 2 , this means A = A 1 / 2 " 1 "^ 1 ) and B = T 1 /^ 0 ! 1 ) 
is the main region of interest. Actually, a careful analysis shows that one may focus on 
A = A 1 / 2 (logT)°« and B = T^logT) 0 ^. 

On the other hand, if A is not too large, then one can use Cauchy’s inequality to move 
the sum over a to the outside, obtaining 


(1.30) 


,, | 2- 2. a l/2+il-.Ti,l/2+ii+IT 

\ t ~ 1 axA bxB 


/|t-T|XA a> 


The diagonal alone is of size AA which is then roughly A 3//2 (for A m A 1 / 2 ) which is better 
than N = (AT) 1 / 2 if A T 1 / 2-6 . The off-diagonal terms may be satisfactorily bounded 
because the f-integral detects close values, say b + h,b with now h ^ 0, and the summand 
is still an oscillatory function of b, so exponential sum estimates will cover the off-diagonal 
terms. This explains the range of applicability in Theorem 11.51 

The arguments of Section [5] are also used in the alternative upper bound of (11.101) . Once 
we know that the important ranges are A = A 1 / 2 (logT)° (1) and B = T 1//2 (logT)° ( P, we 
can use Cauchy’s inequality in a more efficient way than in (11.291) . to avoid losing the 
factor log T. The idea is to first apply Cauchy’s inequality to separate the values of b with 
B = T 1//2 (logT)°d) from the rest. These other values of B still retain the factor logT (which 
comes from counting the number of dyadic values of B) in (jl .29j) . but we are able to save 
a large power of logT to more than compensate for this. Since there are only O (loglogT) 
values of B with B = T^log T)°^\ we can essentially replace (11.29)1 with 


(1.31) 



7 ,7~( II U ' l l I I '} 


2 


<C log log T y t 

B=tV 2 (logT)°( 1 ) 

B dyadic 


' |t—T|xA 


EE 


e(abx) 

a 1 /2+it—iT /2+it+iT 


+ 0(A(logT)- 10 ). 


In addition, the Poisson summation arguments alluded to following (11.291) show 


(1.32) 


I 1 

./|t—T|xA N 


v ' \ > e(n 6 x) 

Z^ Z-^ ^1/2+it+iT 
axA b'X.B 


(log logT) min 


B 2 T 
~T' Ip 
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which is O (log log T) even after summing over dyadic values of B. This method does not 
recover the factor |£(1 + 2iT)\ 2 , and leads to fll.lOp . 

One may also attempt to estimate (11.281) with absolute values, as in Holowinsky’s work 
Although this is an intriguing possibility, I have been unable to improve on Theorem 
11.21 (there is no hope to prove Conjecture 11.31 this way) using this method. See Section 17.21 
for further discussion on this matter. 

1.3. Acknowledgements. I thank Junehyuk Jung and Eren Mehmet Kiral for many con¬ 
versations on this work, and Peter Sarnak for a question that motivated this paper. Much 
of this work was performed while the author was a member at the Institute for Advanced 
Study in Fall 2014. 


2. Notation and initial developments 


2.1. Notation. Recall Vr(y) = ^/yKiT(y). By a well-known formula for the Mellin trasform 
of the A'-Bessel function, we have 


(2.1) 7Ut(1/2 + s):= [ V T (2ny)y s ^- = 2~ 3 /\- s r( 

In y \ 


' 1/2 + s + iT \ (\/2 + s — iT 


It is also helpful to recall 


r°° rill TV l+s+2tT NT-Y I+s \2tV 1+s— 2iT \ 

(2.2) 7 v-(l + s):= V T (2ny) 2 y s - = 2- 2 7r- s -^ 2 2 J 


y 


and 

(2.3) 


Z(s,E t ) :=Yi 


r(i + s) 

At(^) 2 C{s-2iT)((s + 2iT)((s) 2 


n =1 




C(2 s) 


2.2. Miscellaneous lemmas. Here we collect some basic tools used throughout this paper. 


Lemma 2.1 (Vinogradov-Korobov). For some c > 0 and for any |f| 3 > 1, 1 — ^ log ^ 2/3 A 
<7 < 1, we have 

(2.4) + it) « (log |t|) 2/3 , 
and 

(2.5) |(l +! ()«(log|(|) 2/3+ ‘, 

For a reference, see BE Corollary 8.28, Theorem 8.29]. 

We also need the mean value theorem for Dirichlet polynomials of Montgomery and 
Vaughan [ MVj . 


Lemma 2.2 (Montgomery-Vaughan). Let a n be an arbitrary complex sequence. We have 


( 2 . 6 ) 



V a n n m 

n<N 


du = |a n | 2 (V + 0(n)). 

n<N 
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2.3. Initial developments. We will calculate F x t(s) = F(s) here, and then perform some 
easy approximations. Inserting (II.ID into (j 1.13 1) . using (12.ip and the Mellin inversion formula 
for v'iy) = ji, f (1) i>(~u)y H du, we obtain 


(2.7) 


F(s) = flip (1/2 + s + iT) + nip (1/2 + s — iT) 

P*(l) \- T iT (n)e(nx) 1 
+ (2n) s ^ \n\ 1 / 2+s 2t n 

v ' n+ 0 1 1 



ip(—u)\n\ “ 7 y T (l /2 + s + u)du. 


Next we develop some simple approximations. First, note that 

, . |p*(l)| 2 2 /t r 

(2 ' 8) cosh(TrT) “ |C(1 + 27T)| 2 ' 

By (12.ID . combined with (12.8D . we have that the integral appearing in (12.7ft is exponentially 
small if |Im(s + u) — T\ (log 2 T). Since ip has rapid decay in the imaginary direction, we 
have that F(it) <C T -100 , say, for \t\ > T + T £ . 

The size of Im(s) is the most important basic parameter in the analysis of I^(x, T). 

We can simplify F(s) in the range |Im(s)| < T — T £ . Let 


(2.9) 


n M / 1/0 , r iT (n)e(nx) f yjs + iT\\s - iT\ 

Fo(s ) = p (1)717(1/2 + s) 1 _ 1 t /9_i_ g 


njt 0 


ll/2+s 


27T n 


Lemma 2.3. For each e > 0 there exists 6 > 0 so that 


(2.10) [ \F(it)\ 2 dt= f \F 0 (it)\ 2 dt + O(T~ s ). 

J\t\<T—T e ■!\t\<T—T e 

Note that if s = it, 0 < t < T, we have that ip appearing in (j2.9j) is supported on 
\n\ xf^T-t) 1 / 2 . 


Proof. I 11 the range \t\ <T — T £ , Stirling gives 


(2.H) r( ‘/ 2g ) ~v( fP±1 ± l l )( 1/2+ 2 s±g )" /2 ( 1 + f 


P j,±( u ) 

^ (s±iry 


where Pj : ± is a polynomial, and where the meaning of ~ here is as an asymptotic expansion. 
Therefore, since |s ± iT\ T £ , we can truncate the expansion at j x e~ 2 with a very small 
error term. By (12.ip . we have 


(2.12) 1Vt (1/2 + » + u) ~ (2z)-“|» + ff|5| S - iT\i 7VT ( 1/2 + s) (l + £ (f±iTy )- 

Therefore, for s = it , \t\ < T — T £ , 


(2.13) F(s) = [b*(l)7vr(l/2 + *)’£ 

n^O 


- / ip(—u ) 

2m ,/ (1) n ; 


y/\s + iT\\s — iT\\ u j-r / ^ -Pj,±( M ) 

/ J-J-V ^ (siiTV 

1<7'<£- 2 V 


27r|n| 


± 


du 


+ 0(T ~ 1U0 ). 
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The leading term in the ^-integral is calculated exactly via (12. 9ft . so that 

(2.14) F(s)= Y, R(s) + 0(T~ m ), 

0 <l<e~ 2 

say. The lower-order terms in the asymptotic expansion with l > 1 involve linear combina¬ 
tions of derivatives of ip in place of ip, and multiplied by factors of the form (s + iT)~ n (s — 
iT)-i 2 , and are hence all of the same essential shape yet smaller by these additional factors. 
In fact, one can show that f^ <T _ Te \F 0 [it)rdt <C T £ , using only the mean value theorem 
for Dirichlet polynomials (Lemma 12.21 below), and by similar reasoning, for j > 1, we have 
Jjt|<T-T£ ^ T~ s , for some 5 > 0. □ 


Next we smoothly decompose the ^-integral into pieces of the form \t =F T\ x A where 
L « A « T. It will also be necessary to treat the integral of \F(it)\ 2 with \t =F T\ -C T £ for 
which see Section El below. As a first-order approximation, we mention that for such t, we 
have 

7 

(2.15) t k — cosh(7 tT)1 7 y T (1/2 + it) | 2 « (AT) -1 / 2 , k = 0,1, 2,.... 

For the purposes of calculating diagonal terms, it is convenient to record the following 


Lemma 2.4. Let w be a fixed smooth, compactly-supported function. Suppose log N 
(log T) 2 ^ 3+s , for some fixed S > 0. Then 

(2.16) |>.rM| 2 tt>(^) = IC(1 ^ (2 2 ) ’ r)|2 [g(l)lVloglV + 0(lV(logr) 2/3 ^)]. 


Proof. By Mcllin inversion and (12.3[) . the sum on the left hand side of (I2.16p is 


(2.17) 


-: / JV'tZ(s) 


27 ri 


'( 2 ) 


C(» - 2iT)C(a + 2iT)C( s )- 
C(2s) 


■ds. 


We evaluate the integral by the standard method of moving the contour, in this case to 
Re(s) = a = 1 — ( log y) 2 / 3 ; f° r some constant c > 0. We first calculate the residue at s = 1. 
Using 


N s = 1 + (s — 1) log N + 0((s - l) 2 ), 

(2 ' 18) C(s - 2iT)((s + 2 iT) = |C(1 + 2iT)\ 2 (l + (s - l)2Re^(l + 2 iT) + 0((s - l) 2 ), 


and the Vinogradov-Korobov bound (Lemma 12.ip . one calculates that the residue at s = 1 
is consistent with the right hand side of (I2.16p . The residues at s = 1 ± 2 iT are negligible 
because w has rapid decay in the imaginary direction. 

Finally we bound the integral along the line a. We use the Vinogradov-Korobov bound 
again, which leads to an error of size A (log T) 4 / 3 exp(— c ^°^ 2 / 3 ), which is bounded by the 
error term appearing in (12.161) . □ 


For ease of use, we also directly quote the following. 
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Proposition 2.5 ([Y], Theorem 8.1). Suppose that w(x) is a smooth function on the positive 
reals supported on Y < x < 2 Y and satisfying w^\x) <C j (P/Y)i for some parameters 
1 < P <Y. Let 6 = 7/64, and set R = P + -^1. Then for m/0, T/(TY) 5 , we have 

(2.19) T iT{ri)Ti T {n + m)w(n ) = M.T. + E.T ., 
where 

(2.20) M.T. = y] ) f (x + m) TlT x ±lT w(x)dx, 

_l_ C(4) i max(0 ,-m) 

and 

(2.21) E.T. < (Iml^y^i? 2 + tWr})(TY) £ . 

Furthermore, with R = P + yr-, we have 

(2.22) V l-E.T.I < {MtW^R 2 + MtWr^)(TY) e . 

l<\m\<M 

Remark. The bound (12.221) roughly means that the Ramanujan-Petersson conjecture, i.e., 
9 = 0, holds on average over m. 


3. The largest A regime: extracting the main term 
The purpose of this section is to show the following 
Proposition 3.1. Let 6 > rj(T) (logT)"* 5 for some fixed small 5 > 0. Then 


(3.1) 


dt 


dy 


\t\<T-r)(T)T 


Fo(it)\ 2 — = -log(l/4 + T 2 ) / V{y)— + R(x,T) 


2n 7T 


y 


0(77(T) 1 / 2 logT+|log77(T)|), 


where R(x,T) is a term satisfying, for any U > 1, 


(3.2) 


d\x-xo\<jj 

Furthermore, under the same conditions, 
(3.3) f 


R(x, T)dx 




r]~ 2 (T ) logT 


/ \F{it)\ 2 dt <C log T. 

J\t\<T—ij(T)T 

This accounts for the main bulk range of Vs which provide a main term as well as an 
additional term (this is R(x,T) which we can show is small with a tiny extra x- aver aging). 
From Proposition 13.11 we may deduce Theorem 11.11 as follows. We have 


7(^0? T) 


' \x—xo\<if J —oa 


\F x , T (it)\ 2 ^-dx > 


'\x-x 0 \<f J\t\<T-r,(T)T 


\ F x,T(it)\ 


dt 

2tt 


(3.4) 


= -log(l/4 + T 2 )m^ i0 


7r 


1 + 0 


1/2 log log 


log T J 


+ 0 


'\x—Xo\<f 


R(x, T)dx 


The bound (13.21) implies f^ x _ xo ^ < j_ R(x, T)dx <C m^ )7 (log T)(yrf) 1 , recalling that x 
7 /T. Letting 7 —> 00 very slowly (in particular, we may assume 7 -C (logT) 5 , some S > 0), 
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and then setting 77 = 7~ 2 / 5 , we conclude the proof. In fact, this argument exhibits the error 
term in Theorem 1 1.41 but to fully prove Theorem [TT] we need to study y 25 / 2 '+e <c |t=pT| < 77 T 
also. 

Proof of Proposition 1 3. 11 For shorthand, set 77 = 77 (F). Let W{t ) be a function that is 1 on 
\t\ < T — cir]T, and 0 on \t\ > T — C 2 i)T, where here c\ > C 2 > 0 are positive constants to be 
chosen later, and satisfying W^\t) <C (r/T)~T 
We have 


(3.5) 

where 

(3.6) 


W(t)\F 0 (it)\ 2 dt = 


1 p*(!)1 2 

cosh( 7 rT) 


£- 


T iT (m)T iT {n)e((n - m)x) 


\mn 


11/2 


J(m, n ), 


J(m,n) = / W(f)|7v T (l/2 + it)\ 2 cosh(TrT)^ 

J — OO 

In the right hand side of (13. 5 j) . write 


\l\r rr -r\\^( PP - r\ 


2lT 77 


27r m 


\m\ 

Ini 


dt. 


(3.7) 


W(t)\F„(it)f = M.T. + E(x,T) + R(x,T), 


where M.T. corresponds to the terms with m = n (both positive and negative), R(x,T) 
corresponds to the terms with m and n of opposite signs, and E(x, T) are the terms with m 
and n of the same sign, and m 7 ^ n. We will show that 


(3.8) 


M.T. = - log(l/4 + T 2 ) ( /“V 2 (»)- + 0(>; 1/2 )). 
7r V /n y / 


that R(x,T ) satishes (13.2h . and that 

(3.9) E(x, T) < | log 77 1. 

At this point we can also explain how to remove the smoothing factor W. We have that 


(3.10) 


'\t\<T-r/T 


\F 0 (it)\ 2 dt< / W(t)\F 0 (it)\ 2 + / (W+(t)~ W(t))\F 0 m 2 dt : 


' —OO 

poo 


' —00 
poo 


\F 0 (it)\ 2 dt > 


(W_(t)-W{t))\F 0 (it)\ 2 dt, 


W (t)\F 0 (it)\ 2 + 

J\t\ <T—r)T J— 00 J— 00 

where W + and W- satisfy the same properties as W, but with different choices of constants 
q, and such that W + > W > W-. We use the decomposition (I3.7ji to treat the term 
W(t)\F 0 (it)\ 2 dt appearing on each line above. The second integral appearing on each 
line we shall treat slightly differently, as follows. Instead of using the formula (13.51) (but 
with W replaced by | W — W± \), we hrst apply Cauchy’s inequality in the form | a n \ 2 < 

2|E 


n>l a n\ 


2 1 En<-i a n\ 2 - After this, we then open the square and perform the integral. 
The same estimates on E(x, T ) and M.T. apply (in fact the “main term” here is O( 77 1 / 2 logT) 
since the right hand side of (j3.8[) does not depend on IT), and the gain is that the opposite 
sign terms do not appear. That is, (13.81) and (13.91) imply 


(3.11) 


\W±(t) - W(t)\\F 0 {it)\ 2 dt <77 1/2 logT + | log77). 
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The same idea here of using Cauchy’s inequality shows W(t)\F 0 (it)\ 2 dt <C logT + 
E(x,T), which by (13.9ft implies (13.3ft . 

Now we analyze J(m,ri). Note that m and n are supported on 

(3.12) <C \m\, \n\ <C T. 

By Stirling’s approximation (see (12.15ft ). we have that J(m, n) = JY j(m, n, t, T){\m\/\n\) lt dt 1 

where j is a function satisfying j(m,n,t,T ) <C 77 ^ 1 ^ 2 T~ 1 (r]T)~ k . Thus by standard inte¬ 

gration by parts, if \m\ 7 ^ |n|, 


(3.13) 


J(m,n ) 


V 


” 1/2 


[r 7 Tlog(|m|/|n|)] fc ’ 


k = 0,1,2 ,.... 


To get our bearings, we shall check the size of the trivial bound on the off-diagonal terms 
with \m\ 7 / \n\ (it will turn out to be just barely insufficient). For this we use the simple 
bound \xy\ < \x\ 2 + \y\ 2 , (13.13ft . and symmetry, giving 


(3.14) 

E 


m^n 

where h = 

\m\ — 

is 


(3.15) 



\T iT {m)TiT{n)\ 


\mn 


11/2 


|J(m,n)|< Y 


r 1 1 / 2 T<^\m\,\n\<.T 

rriy^n 


I r iT (m)\ 2 77 

71 1 yT\h\ W' 

T ^ \m\ > 


\m\ 


<rr 3/2 T -! |r iT (u)| 2 < 77 - 3 / 2 |C(l + 2iT)| 2 logT. 

|n|<T 


Recalling (12. 8 |) . in all we have shown 

|p*(i)l 2 


(3.16) 


cosh( 7 rT) 


E 


IrtMrtWI 


r) 1 / 2 T<|m|,|n|<T 


77777 


11/2 


| J(m, n)\ <C 77 3 /, 2 logT. 


We will provide an improved estimate on the off-diagonal terms which will give fj3.9[) , but 
prior to that we bound R(x, T) and evaluate the main term. 

Recall that R(x, T ) denotes the terms on the right hand side of (13.51) such that m and n 
have opposite signs. Further write R(x, T ) = R'(x, T ) + R 0 (x, T ) corresponding to the terms 
with |nr| 7 / 1 77 1 and \m\ = |n|, respectively. Then changing variables and using symmetry, we 
have 


(3.17) 


R'(x, T)= 2 


M. 

cosh( 7 rT) 


Re Y 


Tir(m)Ti T (n)e((m + n)x) 


m^n> 0 


\mn 


11/2 


J (m, n) 


The bound (13.16jl obviously shows that R'(x,T ) <C 77 3 / 2 logT, and so we need a small 
savings in the x-integral. For this, one may directly calculate that for any U > 0 that 


(3.18) 


' \x—xq\<U 


e{nx)dx 


< 


1 

n\n 


so, recalling that |r 7 | 3> , we obtain (13.2[) for R', precisely, 


(3.19) 



xo|< 


1 _ 

u 


R'(x, T)dx 


•C 


logT 
7 fT ' 
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Next we evaluate M.T. defined by 


(3.20) M.T. = J°° W(t) It*. (1/2 + it) | 2 f) i, 2 (^^p) *■ 

We need a slight generalization of Lemma [2.41 so we proceed directly. Letting T = -0 2 , we 
have that M.T. equals 

(3.21) 2 |P o (1)|2 [ W{t)\^ VT {l/2 + it)\ 2 J- I ^{-u){T 2 -t 2 )^{2n)- u Z(l+u,E T )dudt. 

2n J_ 00 2m 

We shift the contour to the left, to Re(u) = — A. On the new contour we get a bound of the 
form T _ 5 + ^ +e , by Weyl’s subconvexity bound. By a similar computation as in Lemma [2.41 fin 
the proof it is helpful to remember \t — T\ r]T from the support of W, and rj (logT) -5 ), 
the residue at u = 0 gives 


(3.22) 


>*(1)| 2 |C(1 + 2LT)| 
2 vr C(2) 


-[(logT)T(0) + O((logT) 


2/3+e^j 



W(t)|7\/ T (l/2 + it)\ 2 dt. 


Note 'b(O) = / 0 °°^ 2 (i/)y. We also claim 

1 roc -t roo 

(3.23) — bb(t)| 7 y T (l /2 + it)\ 2 dt = — / |7u T (l/ 2 + it)\ 2 dt + 0(?7 1/2 exp(-7rT)). 

J — oo J —co 


This follows from the estimates 


(3.24) 

[ r“ 1/2 (l + |t-T|)- 1 /2 dt<7 1/2 , 


J T—r/T 

and 


(3.25) 

roo 

j cosh(7rT) 7y T (l/2 + it)\ 2 dt -C T” 1//2 


using (12.15b To calculate the constant on the right hand side of (13.22b we first recall the 
Mellin formula 

i r°° 

(3.26) —J |7Vt(1/2 + it)\ 2 dt = 7 y|(l). 


One then easily checks, using (12. 2 p and the formulas following (II.ip . that 

(3.27) = -• 

C(2) vr 


Thus we have shown (13.8p . 

Finally, we consider the “pseudo”-main term that arises from \m\ = |n| but with m = —n, 
which we denoted Rq(x,T). A similar computation to that for M.T. shows that Rq(x,T) -C 
logT, and (13.2p holds for R 0 . Thus, (13.2p holds for R = R 0 + R'. 

To complete the proof of Proposition l3.ll we need an improved bound (compared to (13.1611 1 
on the off-diagonal terms, E(x,T). It suffices to bound the terms with both m,n > 0, the 
case where both m,n < 0 following from this by symmetry. We shall use Proposition 12.51 to 
treat the terms with m = n + h , h ^ 0. Technically to meet the hypotheses of the proposition 
we need a weight function supported on a dyadic interval, so we apply a dyadic partition of 
unity to the off-diagonal positive values of m,n on the right hand side of (13.511 . Taking one 
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such part of the partition with say n x TV, we get that E(x,T ) is a sum (over TV) of terms 
of the form 

(3.28) S(x, N,T) : = |C(1 + 2fT)| 2 TV^/ur) Ti T (n)T iT {n + h)g(h,n), 

0<|/i|<T e n 


where g is a function satisfying 

d k 


(3.29) 


TV / 

gut g(h,u)«—(l + 


N\h\ \ 

T ) 


-100 


TV' 


-k 


Here this estimate arises by a slight modification of (13. 13j) . using now that if \n\ x TV, then 
\t — T\ x ^r. By Proposition 12.51 and its following remark, S(x,N,T) equals a main term 
plus an error term of size 0(T~ l / 12+e ) (here we use that TV = T 1+0 (P). The main term takes 
the form 


(3.30) 


£l « 1 + 2 ir )l 2 £ n - 1 

± 0<|/i|<T s 


'max(0 ,—h) 


(r + h) ±lT r TiT g(h, r)dr , 


which by trivial estimations using (13 .29 ft is |C(1 + 2 iT) 2 . Since the number of dyadic 
segments is 0(\ log 77 1), we have thus shown (13.91) . We will give a more elaborate analysis of 
this off-diagonal term following (14.23ft below. □ 


4. Large A regime via shifted divisor sum 
Complementing Proposition 13.11 we have 

Proposition 4.1. Suppose that 5 > r}(T) (logT) -5 for some fixed small 5 > 0, and 

(4.1) T^ +£ « A < g{T)T. 

Then for some 5' > 0, we have 

f A 1 / 2 

(4.2) / |F 0 (it)| 2 dt«- I ^logT + T^' + SA, 

JiA<|t T T|<A 1 ' 

where S'a is a nonnegative quantity satisfying 

(4.3) ^ S A < 1. 

1«A«T 

dyadic 

Remark. There is a possibility to allow somewhat smaller values of A by exploiting 
additional savings in the sum over h. However, the improvement is modest and the work is 
technical, so we have omitted this line of discussion. 

Combining Proposition 13.11 and its following discussion with Proposition 14.11 we derive 

(4.4) f I |F 0 (^) | 2 ^ =-log(l/4 +T 2 )m^, 7 

J\t\<T- C(e)T 25 /27+e J\x-x 0 \<X 27T 7T 



Recall m<y 7 x 2.. Using (j3.2|) . and choosing 77 = 7 “ 2//5 , we derive Theorem 11.41 There is 
one technical point that was glossed over in the above proof, which is the restriction that 
77 3 > (log T)~ s for some fixed 6 > 0. This will hold under the additional assumption that 
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7 <C (logT) 5 ' for some fixed small 5' > 0. Once Theorem 11.41 is proved with this restriction 
on 7 , it then applies without the restriction by additivity of the integral and the fact that 
the estimate is uniform in x so that a larger interval can be decomposed as a union of smaller 

ones (each of length <C ). 

We begin with some simplifications of a general nature. 

Lemma 4.2. Suppose that T £ A < r)(T)T, where rj(T) > 0 is sufficiently small. Then 


(4.5) 


' 4 A<|t=fT|<A 


\F 0 (it)\ 2 dt « V T, ±x, N ) + T- 100 , 


where 


(4.6) I (A, T, x, N) — 


Wi 


T-t 

A 


E 

n> 1 


TiT(n)e(nx) fn 

rfll/2+it 


- W2 \n 


dt, N = n/AT, 


and Wi,W 2 are certain fixed smooth weight functions with compact support on the positive 
reals. 


Proof of Lemma 4.2 The essential part of the proof is a separation of variables argument. 
We will need variations on this argument in other parts of the paper, so we provide full 
details here. 

By symmetry, we may consider the portion of the integral with t > 0. Using Stirling’s 


approximation (12.15[) . and Cauchy’s inequality in the form |^ n _^ 0 a n | 2 — 2 IE n >o' 

2 1 E„>o«-n| 2 , we obtain 
(4.7) 


+ 


\F 0 (it)\ 2 dt <C 


1 A<|t—T|<A 


IC(i + 2»r)| 
Vat 


-2 


max 

±x 


§A<|t—T|<A 


E- 

n> 1 


T iT (n)e(nx) (y/T 2 — t 2 


n l/2 +it 




2nn 


dt. 


The support on means that the n-surri is supported on n x N, where 

(4.8) n = Vat. 


This integral is not yet quite in the form I (A, T, x, N ). For this, we multiply -0 by a redundant 
factor W 2 (n/N), such that w-zin/N) = 1 for all n in the support of 0 , yet such that W 2 is 
smooth of compact support. Then we use if(y) = J^ 0 \' l P(~ u )y u du, and apply Cauchy- 
Schwarz to the u-integral, obtaining 


(4.9) 




VT 2 -1 2 

27T77. 


•c 


[IU 


E 


Cl n Tl 


2 

du, 


using the fact that Jf° \fi>(iv)\dv -C 1 (the reader may recall that 0 was fixed at the beginning 
of the paper and is independent of all relevant parameters). 
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Iii our application, we may truncate the u-integral at |w| < leading to an error that is 
at most O(T- 10 °), using the rapid decay of ip, and the fact that A T £ . That is, 


(4.10) 


5 A<|t—T|<A 


E 

n> 1 


TiT(n)e(nx ) , (\/T 2 — t 2 


n l/2 +it 


ip 


< 


27rn 




±A<|i-T|<A |u|<4j 


E 

n> 1 


dt 


Tirr{n)e(nx) /n 


/2+it+iu 


W 2 


A 


dtdu + 0(T 


- 100 \ 


Next we change variables t —> t — u and over-extend the f-range of integration to ^A < 
|t — T\ < yjhA. In this way the variables u and t are separated, and the ^-integral can be 
bounded again using \ip(iv)\dv -C 1. 

Finally, by attaching a smooth weight function wi (-^) we obtain the desired result. □ 

Proof of Proposition f.l. The basic approach is similar to that of Proposition 13.11 in that 
we shall solve a shifted divisor problem. 

In light of Lemma 14.21 we need to show 


(4.11) 


/(A, T, x, A) < |C(l + 27T)| 2 (AlogT +AT’ 5 ' +NS A ), 


for A J l25 / 27 + e ; ail d some 5' > 0. Opening the square and integrating in t, we obtain 
(4.12) 


J(A,T,x,A)= ^ T iT (m)T iT (n)e((m - n)x) 

m,n> 1 




/mn 


Wi 


T-t 

A 


n \ lt , 

— ) dt. 

m / 


By Lemma [2.41 we have that the terms with m = n contribute to /(A, T, x, A) an amount 
(4.13) < A|C(l + 2zT)| 2 logT, 

which leads to the first bound in (14.111) . 

Our main job is to bound the terms with m ^ n, in which case we set m = n + h. By 
simple estimations, we then have that these terms contribute to 7(A, T, x, A) an amount at 
most 

< 4 - 14 > §E e(hx) ^Tir(R)Tjr(R + h)K{n , h), 




where 

(4.15) 


K (n, h) = 


W 2 (f) W 2 (^r) / W + /^~ ir 

(#) 1/2 (^) 1/2 ^ » 


Wi 


A (n + h 

— log I- 

27T V n 


In particular, K (n, h) is supported on n x A and satisfies 


(4.16) 


<T . / A In I \ 

%«)«FA-dl + 


ch/4 


with A > 0 arbitrary, and where 
(4.17) 


A 


A = A T. 


j =0,1,2,..., 
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Proposition 12.51 gives 

(4.18) ^^e(hx)(^T iT {n)T iT {n + h)K(n,h) - M.T.'j <C T+ e N^R 2 + T* +e N*R*, 

h^O n 

where M.T. is a main term (off-diagonal) to be discussed presently. A short calculation 
shows that the two error terms in (14.181) are 0((AT) 1 ' /2 T~ <5 ) for A T^? +£ . It turns out 
that the main terms are surprisingly difficult to estimate with uniformity in x, and we next 
turn to this problem. 

The off-diagonal main term is defined by 

(4.19) m.t. = Y °~-iW [ (v + h )* lT y ±lTK (y, h )dy■ 

These two terms then give to /(A, T, x, N ) an amount 


(4.20) lc(1 A 2 : T)|2 EvE< fa )^w 

i h^O 


C(2) 


Wf) W2{ s jr) fy + 

(f) 1/2 (^)V 2 v y ) 


-icgf^A 

2 v t B \ y 


dy. 


When the choice of sign makes the exponent of (y + h)/y be —2 iT, then a simple integration 
by parts argument shows that 


(4.21) 


w 2 (ff)w 2 {^r) fy + h^~ 2iT ^f A (y + h 
1 1 wM-log 


'0 


(w ) 1/2 (Hr ) 1/2 ^ y 


2tt 


y 


dy < All + 


EL\ 

N ) 


-10 


Hence this off-diagonal term contributes to I(A,T,x, N) an amount that is 

(4.22) « |C(1 + 2iT)\ 2 ^Y N ( 1 + 10 « K(1 + 2^T)| 2 A^ « |C(1 + 2 tT)| 2 A, 

V h^O 

since iV<T, which is more than satisfactory for (14.111) . The other main term is much more 
subtle, and gives 

(4.23) 

|C(1 + 2iT)| 2 A 


MTqd '■ = 


C(2) N 


"Y, e(hx)<j_i(h) 


f y+h \ 

IT 


r °° A ,_(y + h 


0 


'o © 1/2 («#A 2 


Wi 


2p° g 


y 


dy. 


We will show here 
(4.24) 


MTqd < |C(l + 27T)| 2 (AlogT + NS A ), 


where S’a is a function satisfying (14.31) . It is quite easy to show with a trivial bound that 
MTod |C(1 + 2iT)\ 2 N, so (14.241) amounts to a logarithmic savings with the sum over A 
in dyadic segments (there are of course O(logT) such values of A over which to sum). As a 
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first simple approximation step, by taking Taylor expansions we have that 
r “«> 2 (-|)w 2 ( 2 s i )_/ A t _fy + h' 


(4.25) 




w 2 { 


y 

y 12 


/ o (f) 1/2 (^) 1/2 

h 

' y 

and hence by a trivial estimate on the /i-surn, we have 

w 2 {y) 2 

e\t lx) u _ip L) I 

h^O 


^(-a> + 0( n(, + M)-""), 


- 100 \ 


(4.26) MTqd = ^e^a.^h) 


C(2) 


y 


Ah \ 


wi - 


2nNy 
N 


)dy 


+ o(C|((l + 2iT)| 2 lo g r). 


This error term is certainly 0(NT 
(4.27) MT od < |C(1 + 2eT)| 2 A 


-5\ 


since 
y \2 


A t 25 / 27+£ . By changing variables, we have 


Ml) 


y 


A 


— ^a-i(h)e(ha;)u7i(- 


0 


Ah, 

'AT? 


dy + 0(NT 


—S\ 


We need to control the sum over h, on average over A running over dyadic segments. We 
pause our estimations of MT OD to state the following. 

Lemma 4.3. Let w\ be a smooth function supported on the positive reals, and for x € M. 
and H 1, define 

(4.28) Q(x,H ) = H~ l a_i(h)e{hx)wj(h/H). 

h^O 

Then 

(4.29) ^ |Q(x,tf)|«l, 

l<gfH<oo 
H dyadic 

where the implied constant depends only on w\ and is uniform in x. 

Lemma [4731 suffices to show (14. 24p . as H = y ^ and A runs over dyadic segments. 

This means that H can be partitioned into two classes of dyadic segments (taking even 
powers and odd powers separately, for example). 

Having accounted for all the terms, the proof of Proposition 14.II is complete. □ 


Proof of Lemma |£A[ We open cr_i (h) as a convolution, obtaining 

r ab\ 

.HJ 


(4.30) 


Q(x,H) = H 1 y? e(abx)wi (— j. 

a> 1 o 

For larger values of a, say a > K > 1, we estimate trivially, obtaining 


(4.31) 


H ‘IT A eU,hx > w '(jj )« f 

a>K b=£0 


For a < K, we apply Poisson summation, getting 


(4.32) H 1 a 1 y? e(abx)wi 

b^O 


a<K 


ab\ 

HJ 


£ 


• = > a 

a<K 


-i( ~wi(0) 1 

H a 
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Thus 

(4.33) Q(x,H) = '^ j a~ 2 ^2 / w 1 ( y H{x - ^)) + O (k~ 1 + log ^ K ^ ^. 

a<K p(EZ 


Suppose that W\ is supported on [ck, j3\ where 0 < a < (3. If we assume that K < eH for 
£ _1 > 2/3, then the sum over v will capture at most one value of u, which must be the integer 
nearest ax. More precisely, we have that the sum over v vanishes unless a < ||aa;||-^ < /3. 
For each a, there are at most 1 + log 2 (/9/a) dyadic values of H such that the sum over v 
does not vanish. Thus 


(4.34) 


E E \ w f H ( x ~ v - 

H^e - 1 
H dyadic 



We set K x eH , and by reversal of order of summation combined with (14.341) . we derive 


(4.35) 


, . x ~f) 

1 a<^eH ve'L 
dyadic 



oo 

«^a' 2 « 1 . 

a— 1 


Similarly, for H -C £ 1 we use only the trivial bound Q(x, H ) 1, which needs to be used 

O e ( 1) times. Thus we obtain that (14.291) holds. □ 


5. The small A regime 


5.1. Statement of results. One goal of this section is to collect some results which together 
prove Theorem 1 1.51 Much of our work here is also crucial in the proof of Theorem 11.21 Recall 
/(A,T, x, N ) is defined by (14.61) . One of the main results of this section is 

Proposition 5.1. Suppose T £ <A< Ti/ 2 ~ e ; for some £ > 0. Then 

(5.1) /(A,T,x,iV)« A|C(l + 2^T)| 2 ((logT)- 10 + ^ A ), 

where S& has the properties described in Proposition \f.l\ 


Proposition 15.11 then implies Theorem 11.51 via Lemma 14.21 

It is of great interest to extend the range of A in Proposition 15.11 especially as large 
as E 25 / 27+£ , since that would prove Conjecture 11.31 The methods in this section rely on 
opening the divisor function TiT^n) = )C ab=n (a/fr)* r an d do not rely on the spectral theory 
of automorphic forms but rather abelian harmonic analysis. The relative sizes of a and b 
play a major role in our estimations, and so we wish to localize a and b at this early stage. 
To this end, define 


(5.2) 



e(abx) 

a l/2-itf ) l/2+2iT-it 



2 

dt , 


where A, B 1, and w,f,g are fixed smooth compactly-supported weight functions. For 
small values of b , it is better to not use a dyadic partition. Instead, let Bq x T a for some 
small 0 < a < 1/4, and let g 0 be a smooth function such that g 0 (y ) = 1 for y < 1, and 
go{y) = 0 for y > 2. Then set 


I Bo 



e(abx) / b \ / ab\ 

a l/2-it b l/2+2iT-it 90 \^~J W2 \n) 


2 

dt. 


(5.3) 
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The values of 6 near T 1//2 are also somewhat exceptional, and it is helpful to employ a 
configuration taking this into account. To this end, define 

T 1 / 2 

< 5 ' 4 ) yr®. Bt = T^iogrr. 

We elucidate a connection between I(A,T,x, N), Ia,b , and Jb 0 , with the following 

Lemma 5.2. With certain w, f,g,go, B 0 as above, and with AB x N = \JAT, we have 

(5.5) 

I{A,T,x, N) < J B o+loglogr /n.s + logT(^ Y + Ia > b ) + • 

dyadic dyadic dyadic 


Proof of Lemma [5J% In the definition of /(A, T, x, IV) (that is, fj4.6p ). write r^n) = )Caf>=n( a /^) 
and then apply a partition of unity to the 6-sum. In this way, we have 


(5.6) 


I(A,T,x,N) 



e(abx)gj(b ) / ab\ 

q! / 2—itfjl / 2+2iT—it W2 \ jy / 


2 


where gj are elements of the partition. We do not want to immediately apply Cauchy’s 
inequality to take the sum over j to the outside, because this loses a factor of log T which is 
fatal. However, in some ranges of the parameters, we save a large power of logT that is able 
to compensate for this, so we are free to lose log T to simplify the analysis. This discussion 
helps motivate the following decomposition. 

First we choose go(b/Bo ) to be one element of the partition of unity in the 6-sum. Next 
we simply apply Cauchy’s inequality to separate this part of the partition from all the other 
terms. This explains the presence of Jb 0 in (15.5p : the main point here is we did not lose the 
factor logT. 

We may assume that g 0 is such that 1 — g 0 (y/B 0 ) equals a dyadic partition of unity 
(meaning, each gj(b) = g(b/B), say with 6xB, where AB = N , and B runs over a dyadic 
sequence), since we could start with a dyadic partition of unity of this type, and then write 
go as a finite sum of elements of this dyadic partition. Having located 6x5 with g(b/B ), 
a is automatically localized by a x A by the support of w 2 . We are then free to multiply 
by f(a/A ) where / is smooth of compact support, and such that f(a/A ) = 1 for all a in the 
support of g(b/B)w 2 (ab/N). In all, this shows that we may write 


(5.7) 


I(A,T,x,N) < I Bo + 



B dyadic 


e(abx)f(a/A)g(b/B) rab\ 

a l/2—it fol/2+2iT—it W2 \n) 


2 


Next we separate the sum over B into two more blocks, those with Bf <C B <C Bf, and the 
complement. There are <C log logT dyadic values of B in the first block, and <C logT in the 
second block. Applying Cauchy’s inequality first to separate the two blocks, and then again 
within each block, shows an estimate almost of the form (I5.5H . the only difference being that 
instead of Ia,b, we have an expression of the form 


(5.8) 



e(abx)f(a/A)g(b/B) rab\ 

a l/2—it Jjl/2+2iT—it W2 \]\[) 


2 
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That is, we have a weight function of the form f(a/A)g(b/B)w 2 (ab/N), and we would like 
to omit w 2 in order to separate the variables further. Using a method very similar to that in 
the proof of Lemma [431 we can do this, at the cost of an error term of size A~ 100 iV logT. □ 

In the application to I(A,T,x, N) with A T e , we have AB = N = (AT) 1 / 2 . However, 
for some technical reasons, when A < T £ , we need a minor generalization of the range of 
A, B. For this reason, in the rest of this section we let 

(5.9) AB = M, where T 1/2 ~ £ < M < T 1-e . 

Before developing more intricate bounds on I a ,b, we record a “trivial” bound: 

Lemma 5.3. We have 


(5.10) 


I a ,b <C (A + M) log T. 


Proof. By the mean value theorem for Dirichlet polynomials (Lemma 12.211 , we have 

(5-11) Ia,b < + i) NtOl A ’ ^l 2 ’ 

nxM 


where we define 


(5.12) T iT (n;A,B ) = (a/b) lT f(a/A)g(b/B). 

ab=n 


If we had not localized the variables a x A, b x B, Twin', A, B ) would simply be TiT(n)w 2 (n/N), 
and Lemma [ATI would give a bound of the desired shape (multiplied by |C(l+2iT)| 2 ). Instead, 
we may use the trivial bound |t*t(^; A, B)\ < To(n; A, B), and by elementary considerations, 


(5.13) 


W |r 0 (n; AB)| 2 « ^ 1. 


n<N 


bifoxB n<N 

n =0 (mod [61,62]) 


To complete the proof, we calculate easily that 


(5.14) 


E 

bi,b 2 >zB 


1 

h , 62 ] 


« d - 1 



6i,6 2 xS/d 


1 

b x b 2 


< log 5. 


□ 


The rest of this section contains lemmas that show various improvements over Lemma [5731 
For ease of reference, we collect a few lemmas together, deferring the proofs until later in 
this section. 


Lemma 5.4. Suppose that for some e > 0, we have 

(5.15) A T £ «B« T 1_£ , 

and 1«A< T l ~ e . Then there exists 5 > 0 (depending on e) so that 

(5.16) I A , B < MT~ 6 . 

The dependence of 5 on £ is not explicit because it relies on iterated uses of Van der 
Corput’s A-Process. 

Lemma 5.5. Suppose 1 < A -C T l ~ £ , and B -C T 1-£ . Then 

(5.17) I A , B < (A + log log T. 

This bound is 0{MT~ s ) for B > T 1 / 2 ^ and A <C MT~ 6 . 
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Lemma 5.6. Suppose (logT) 100 <A <C T 1 e and B T £ . Then for some 6 > 0 (depending 
on e), we have 

(5.18) Ia.r « M|C(1 + 2iT)| 2 (logT) -50 + loglogT + AloglogT. 

Finally we require a result valid for B -C T e . 


Lemma 5.7. Suppose that Bq = T a , for some fixed 0 < a < 1/4, and (logT) 100 <C A -C 
T l ~ £ . Then with Sa satisfying the same properties as in Proposition f.l\ we have 

(5.19) I Bo < M|C(1 + 2fT)| 2 ((logT) -50 + S A ) + Alog T. 


Remark. In truth, the range of applicability of the method of proof of Lemma 15.71 is for 
values of b at most slightly larger than a. 

Before proceeding to the proofs of these lemmas, we describe how they combine to prove 
Proposition 15.11 We need to bound all the terms on the right hand side of (15.51) . Lemma 
15.71 treats Ib q with an acceptable bound. Next we analyze Ia,b- I 11 case T 1//2-e , 

we apply Lemma [5.61 In case B T l/2 ~ s , we may apply Lemma [5.41 since we are assuming 
A < T l / 2 ~ e ' for some e' > 0. We have therefore covered all the ranges of B , showing 
Proposition 15.11 

The astute reader may notice that we did not require Lemma 15.51 Nevertheless, it is 
conceptually satisfying, since when combined with Lemmas 15.61 and 15.71 it shows that we 
can handle all values of B satisfactorily except for B = T 1/,2 (logT)°W, and hence A = 
A 1//2 (log T)° (i b The key point that allows us to treat all values of B, for T £ <A< T 1 / 2-6 , - g 
that Lemma [5.41 is applicable in the range B A T £ which overlaps the otherwise problematic 
region B = T 1 / 2+ °( 1 ) ) at least for A -C T l A~ £ . Anyway, we shall use Lemma [5751 in the proof 
of (HU). 

Using these bounds, we may also conclude 


Corollary 5.8. Suppose T £ < A < T 1 £ . Then 

(5.20) I(A,T,x,N) « + | C (1 + 2iT)\ 2 S A + (loglogT) 2 ). 


Remark. The first term in (I5.20p may be dropped by comparison to the third term with 
(log logT) 2 . 

For this, we use (15.51) . Lemma 15771 treats I Bo , as in the proof of Proposition 15.11 leading to 
the first two terms in (15.201) . For the terms with Ia,b we take the minimum of the bounds 
stated in Lemmas 15.51 and 15.61 Thus we obtain 


(5.21) 


^ Ia,b N 


dyadic 


|C(1 + 2*T)P 

(logT) 50 


+ N log log t y: 


mm 


A B 2 T 


N 2 ’ B 2 


B 1 

dyadic 


Recall that ^ 5 - = and so the dyadic sums satisfy Ss<tV 2 ^ and X1 b>t 1 /2 J 2 
1. Therefore, the bound on this range of R’s is acceptable for Corollary 15.81 For the other 
values of B with B Bf or B <C Bf , we lose a factor logT (the one directly appearing in 
(15.5p ). but we gain a large power of logT back since B -C Bf or 5> Bf, so the bound on 
these terms is already included on the right hand side of (15.201) . 

From Corollary 15.81 by summing over the O(logT) values of A, we may conclude 
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Corollary 5.9. We have 
(5.22) 


\F(it)\ 2 dt <C 1 + 


(log T) (log log Tf 


/T £ <|tq=T|<T 1 - £ |C(l + 2iT)| 2 

We may also reap a small improvement over Lemma 15.31 
Corollary 5.10. Suppose N x AB x (AT) 1 / 2 . Then 
(5.23) I a ,b <C (A + N) log log T. 

Proof. This follows by combining Lemmas 15.51 and 15.61 again, and noting that iV|£(l 
2zT)| 2 (logT)~ 50 is negligible compared to the displayed bound. 

5.2. Proof of Lemma 15.41 By Cauchy’s inequality applied to the inner a-surn, we derive 


(5.24) 


1 a.b « / w (x) 5^/( a M)| 

J — OO ' ^ ' 1 


Then we square out and execute the integral, getting 

{h 2 /h l ) 2lT jb x \ rh 


e(abx) 
Ip/'i+'iiT-ithy-ff 

( A 


dt. 


(5.25) I AtB « A 


61,62 a 


For the diagonal b\ = b 2 , there is no possible cancellation, and we get a bound of size O(AA), 
which is <C ABT ~ £ , by (15.1511 . 

We need to bound the off-diagonal terms also. For b\ ^ b 2 , we change variables bi = b, 
b 2 = b + h to obtain 


(5.26) 
with 

(5.27) 


A 


I a ,b < AA + — f ( a / A ) e (~ axh ) 


W(h,b) = 


h^- 0 a 




6 + h\~ 2iT 


W(h,b ), 


9(V) 


^ i A / b + h 

-w ( x— log 


(6/5) 1 / 2 ((6 + /i)/5)V2 V2 vr*“°V b 

Note that the trivial bound applied to the off-diagonal terms gives I a ,r <C A A + AB, so we 
need to show there is some cancellation. It is easy to see that 

d j |/?|Aj-c 


(5.28) 


dy- 


-W(h,y ) 5" J (l + 


5 


for arbitrary C > 0. More importantly, the phase function <f{y) = — 2T\og((y + h)/y) 
satisfies 


(5.29) 


d j 

dyi K ’ 


B J F, where F = 


m 

B ' 


Thus by [GKi Theorem 2.9] (alternatively, see [IK, Theorem 8.4] for a bound with more 
stringent hypotheses on F) we have that 


(5.30) 


E 


b + h\ ~ 2iT 


W{h,b) <^ q B 


B «+ 2 


4Q — 2 B 

+ F’ 


where q E {0,1, 2,... } is at our disposal, and Q = 2 q . Therefore, as long as F C f < B k 
for some rj > 0 and k G N, we may claim a power saving in the 6-sum (this is initially 


+ □ 
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given as a power saving in 5, but since B T e , it is a power saving in T also). Note that 
F > T e , so the lower bound on F is satished, and also we may assume \h\ <C £ T £ , 

by (15.281) . in which case F <C j^T e <C T 1+£ . In particular, we have F <C B k for some k 
depending on e > 0. Therefore, we may claim the bound (15.161) . 


5.3. Proof of Lemma 15.51 We may certainly assume B ^ T 1 / 2 (logT) -50 , since otherwise 
the bound is trivial. The basic idea is that Poisson summation in b inside the absolute square 
gives a savings by reducing the length of summation (provided B is slightly larger than T 1 / 2 ). 
Furthermore, this process is compatible with the cancellation from the t-integration. 

We have, with G(u) = w -1/,2 g(w), that 

( 5 - 31 ) S (^) = B ~ 1/2 / < axu + lu ~ log u ) G ( u / B ) du - 

b lez J ~°° 


Write the phase here as (j)(u) = (ax + l)u — FF-t log-u, which satishes 


(5.32) 


4 >(u) = (ax + l) 


2 T-t 
2lTU 


We first claim that the integral is small unless there is a stationary point. The second term 
in the derivative of the phase is x and repeated integration by parts (see [ BKY . Lemma 
8.1]) shows that the integral is small (e.g., 0(T~ 100 )) unless (ax + l) x T. By periodicity, 
we may as well assume |x| < 1/2, in which case a\x\ < a -C ^ = o(T / B) (recall (15.91) ). and 
therefore this means 1 x |. We think of l + ax as a perturbation of l. The stationary point 
occurs at 2ttu 0 = By [ BKY . Proposition 8.2], we get an asymptotic expansion for the 

6 -sum in the form 


(5.33) 


£ 


e(abx) 

l ) l/2+2iT-it9 



2 T-t 
2ne 


-i(2T-t) 


(l + ax) 1 F- 2iT + it 


P 


2 T-t \ 
B(l + ax) J 


+ O(T~ 10 °), 


where P is a function supported on v x 1, satisfying P^\v) 1. The phase here takes the 

form 


(5.34) 


e itp(t,T) 

(l + ax) %t ~ 2lT ’ 


for some function (f independent of a and l. Altogether, this shows 


(5.35) 


Ia,b = 


w 


i)l£ 


p( 


2 T-t 
B{l+ax) 


)/(D 


a 1 / 2 lt (l + ax) 1 / 2 2lT + lt 


dt + Q(T 


-50\ 


Since <C ^ <C T £ , and ^ < T E , we can take Taylor expansions to 

obtain 


a|z| 


(5.36) 


P 


2 T-t 


( B(l 


ax} 


^(f)+I>4f)A rm-oiT-n, 




(2T\ / t y/ax\ k 

A Wl)\f) \T ) 


where P 3 .k is a finite linear combination of derivatives of P, with j + k > 1, and the total 
number of terms in the sum bounded by a function of e. By a use of Cauchy’s inequality, 
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we then have 


(5.37) 


Ia,b 


w 


E 

a,l 


Ui)/(i) 


a 1 / 2 lt (l + ax) 1 / 2 2 * T+lt 


c/t + 0(T~ 5U ), 


where by abuse of notation, P is meant to represent P or one of the (of course the case 
of P is the hardest for us to estimate, as the terms with j + k > 1 will be smaller than this 
by a factor <C T ~ £ , so the notation is not that abusive after all). 

Change variables a —>• da and l —$■ dl where now (a, l) = 1. Define the coefficients 


(5.38) 

so that 

(5.39) 




E 

d= 1 


ad') 


P(m)fL 


d'- 


2iT 


Ia,b 


w 


z)lE 




(a,Z)= 


i a 1 / 2 -*^/ + ax) 1 / 2 


-2iT+it 


dt + 0(T 


- 50 \ 


Note that 7 a ^ = 0 unless a x Al. Of course we have the trivial bound |7 aj ;| -C 1, but in 
many ranges this can be improved: 

(5.40) l7a.il « (logT)" 50 , if l « |(logT)- 2 / 3 - £ . 

This follows from the Vinogradov-Korobov bound fLemma l2.ll) . Consequently, we have 

(5.41) Y « log log T - 

a,l 


Similarly, if we define jx = max; x x 17 a ,i| (with no restriction on a), then Ylx dyadic 7x 
log log T. 

Now we continue with the estimation of Ia,b- By opening the square and performing the 
f-integral, we obtain with a trivial bound that 

(5.42) Ia.r «A ^ 

(ai,Zi)=l 
(“2,0)=! 


I7ai.ii7aa.ia I 


{aia^hh) 1 / 2 


( A ^ a x {l 2 + a 2 x)\ 
a 2 (l\ + a\x) / 


w\ 

V2vr 


+ 0(T~ 50 ) 


Here we used that / + ax x /. Note that a\{l 2 + a 2 x) — a 2 (/i + a^x) = a\l 2 — a 2 l\. Thus we 
obtain 

•43) I a .b « A Y 

(ai,Zi)=l 
(«2,i2) = l 


l7ai,Zi7a2,i2 I 

(aia 2 lil 2 y / 2 


1 + 


A|ai/ 2 — a 2 l i 
a 2 l\ 


\ -100 
J +0(T“ 50 ). 


The diagonal contribution, that is, the solutions with a±l 2 = a 2 l\, in fact have oq = a 2 and 
li = l 2 , and so contribute at most 

(5.44) <A Y < A log log T, 

(a,Z)=l ° 

by (15.411) . Next we estimate the off-diagonal contributions. For simplicity, let us localize 
the variables dyadically, say l± x X\ and l 2 x X 2 , where now X\ and X 2 range over a 
dyadic set of numbers -C T/B. To ease the exposition, let us also only count the solutions 
with |di/ 2 — a 2 /i| -C the larger values do not lead to a larger bound, because of the 
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rapid decay of the weight function. Observe that |f^- — ^ x {X\X 2 )^ x ■ Therefore, 

given coprime ai,li with l\ x X\, the number of coprime a 2 ,h with l 2 xX 2 , and satisfying 
Iff — ^1 TjTK x TK' mos ^ XiX 2 AL (since their spacing is at least ( XiX 2 )~ l ). 

Therefore, the contribution of the off-diagonal terms is 


(5.45) 


< A Tai7x 2 


Xi.x 2 

dyadic 


T X\M 
MX x X 2 t 


-XiX, 


M 

TA 


<C E Txfix* 

Xi,X 2 

dyadic 


X\M 

T 


We have Evj dyadic Ix^X 2 < ( T/B ) 2 , and Ex 2 dyadic 7x 2 « log log T. In all, the off-diagonal 
terms contribute to Ia,b at most 

TM 

(5.46) < log log T. 

Combining (15.44)1 and (15.46(1 completes the proof. 


5.4. Proof of Lemma l5.6L The initial steps of the proof are quite similar to that of Lemma 
15.51 except that we apply Poisson summation in a instead of b. The later steps are different 
because we encounter a much more difficult counting argument in this problem. We may 
assume B -C A~ 1//2 M(logT) 100 , equivalently, A A 1//2 (logT) _100 , since otherwise the 
bound is trivial. We have 


(5.47) C [ a ^ X l f(a/A) = [ e((bx-q)u+^—\ogu)u l/2 f{u/A)du. 

a=i a — 


By | 6KY. Lemma 8.1], the integral is 0(AA 10 °) unless | bx 


(5.48) 



A 

M' 


q | x A, that is, 


Assuming (15.4811 holds, the integral can be asymptotically evaluated using stationary phase. 
The conditions of [ BKY , Proposition 8.2] are not quite met if A is small compared to A. 
Instead, we may quote [ Hul Lemma 5.5.6] (the reader may also try to derive a weighted 
version of m Corollary 8.15]). In this way, we have 


(5.49) 


E 

a= 1 


e(abx) 

a i/ 2 — 


f{<*/ A ) 


(t/e )“E 

q&I 


lMsL,n(Xl\ 

(q-bx) 1 /^ V A 3 / 2 / ’ 


where as in the proof of Lemma [531 P(v) is a smooth function supported on v x 1, satisfying 
P u \v) < 1 (in fact, P here is a constant multiple of /; obtaining a stronger error term 
requires altering the weight function slightly, but the quoted error term is strong enough for 
us here). By a use of Cauchy-Schwarz to separate the main term from the error term, and 
trivially integrating over t in the error term, we derive 


(5.50) 


Ia,b -C 



SWB)P(^) 

51/2+2 iT-it^q _ i> x y/2+it 


2 dt + 0(ABA~ 2 ). 


The error term is satisfactory. 

By analogy with the mean value theorem for Dirichlet polynomials, one might expect that 
Ia,b (A + )T £ , and note = A^ = M (and if M = N = VAT, this is N 

However, there are some differences so this guess is not entirely reliable, yet partially hints 
at the truth. 
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Next we perform some simplifications, towards separation of variables. Write (b, q) = d, 
and set b = db 0 , q = dq 0 where now (g 0 , fe 0 ) = 1. In the summation, we enforce the condition 
(15.48(1 (of course q/b = qo/bo), which is redundant to the support of P. We then obtain 
(5.51) 


Ia,b 


w 


x)lE 


1 


cd+2 i T 


E 




(£>0,<?o) = l 

or 


( l/2+2iT-it 
5 0 


(g 0 - b 0 xy/ 2+it 


dt + 0(ABA 


- 2 \ 


I bg I 


By the same separation of variables method as in the proof of Lemma 14.21 we can remove the 
weight function P by a Mellin transform. Let us say that the integration variable attached 
to P is v. Changing variables t —> t — v, and letting 2 T' = 2 T + v, we obtain, for some 
smooth compactly-supported function w 3 , 

(5.52) 


I a b max / w 3 

|T'-T|«A« J^ 


E 


d 1 + 2iT ’ 


E 


g(db 0 /B) 


(foo,<?o) = l 


, f+2 iT' — it, 

bo (<?o 


box) 


\+it 


dt + 


M 

A 2 ' 


h-wl 

I OQ 1 


Dehne the following coefficients by 


(5.53) 

so with this notation 


Otb 



g(db/B ) 

^1+2 iT 1 ’ 


(5.54) 


Ia,b -C max 


|T'-T|<A= d_ c 


"A A 


E 




(6o,9o)=1 

Of 




2 , M 
it+ A2- 


h-wl 

I 5 0 l 


We claim the following bounds on a*,: 


(5.55) 




(log r) -100 , 

rp—S 

? 


fOT h < (log^/3 + e 

for b <C B 1 / 2 . 


Here <5 > 0 depends on the size of e > 0, where B T e . The first bound follows by using 
a Mellin transform, and the Vinogradov-Korobov bound (Lemma 12.11) . The latter bound 
comes from |GK1 Theorem 2.9]. Of course, we also have the trivial bound ck^ <C 1 which is 
the best we may claim for b -B(logT) -2 / 3 . One easy deduction from (15. 5 5ft is that 

(5.56) ^2 < lo S lo S 

coming from the range where we only may claim the trivial bound on a&. 
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Next we open the square and perform the integration. For notational simplicity, we drop 
the max over T' by assuming that T' is chosen to maximize the expression. Then we obtain 


(5.57) Iaj, -r A 


E 

(6i,9l)=l 
\x-m XM - 




(61(91 ~ hx )) 1 / 2 


a b2 


Z 

( 62 , 92 )=! 

P-glxM-iA 


(62 (92 - 62a;)) 1 / 2 


/ 6 2 (gi-M)\\ 


plus 0(MA 2 ). We give up any cancellation, and bound everything with absolute values. 
It is easy to see that under the assumption (15.481) . we have 

Y A /62(91 - bix) y 


w 3 


(5.58) 

Therefore, we have 

(5.59) I a .r < M 


2 vr 


log 


61(92 - b 2 x)J 


<C 1 + M 


9i 

61 


E E 

(61, 9 l)=l (62,92)=! 

Ix-fllxM-iAp-fJIxM-iA 


| ^62 | 

bib 2 


1 + M 


92 

62 


9i 

61 


-10 


92 

62 


-10 


+ MA 


-2 


Consider the diagonal contribution to (15.591) . that is, the terms with q x = q 2 and b\ — b 2 . 
These terms contribution to Ia,b an amount that is 

, ^lad 2 / A b' 

(5.60) < M 

b<^B 


\Otb\ 


1+ m)' 


Using (15.56j) . the part with jj contributes O(AloglogT). On the other hand, with the 
trivial bound a b <C 1 for b U 1 / 2 , and with a b T~ s for b <C B 1 / 2 , we obtain that 
M J2b<^B b~ 2 \ a b\ 2 = 0(MT~ 5 ), for some 5 > 0. Thus the diagonal terms are bounded in 
accordance with Lemma [5.61 

Now consider the off-diagonal terms where 91/61 7 ^ 92 / 62 - Suppose that ||/ — j^\ -C M _1 
(the more general case where the difference is x CM~ l for C S> 1 is similar, and the 
decay of more than compensates for the extra terms, so we omit the proof for this case). 
Further restrict to say 61 x X\, and b 2 x X 2l where the Xi range over dyadic segments, 
with 1 <C Xi <C B. We claim the number of solutions in ( 91 , 61 ) to (15.481) with b\ x X± 
is 1 H— jA. To see this, note that the term “1” accounts for a potential solution (this 


certainly cannot be improved since given q,b there exist x satisfying (I5.48p h Once such a 
solution is found, we can estimate the number of others by noting that the spacing between 
two reduced fractions of denominator of size Xi is Xf 2 , so the total number of such 
fractions that can be packed into an interval of length L is bounded from above by 1 + LX 2 . 
Similarly, for each choice of (91,61), the number of (92,62) with 0 < ||/ — ||| -C M" 1 
Let ax, = maxfoxXi |a&|. Similarly to (|5.56|) . the bounds (15.55j) imply that 


is <C 


-ViX 2 

M 


X)i<x= 2 -C log log T. Thus the bound on the off-diagonal terms is 


(5.61) < M 

x 1: x 2 

which completes the proof. 


AX^a^ax.X.X, ^ 

+ M ) X ,X 2 W~ 


log log T + 


AB 2 


log log T, 
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5.5. Proof of Lemma 15.71 When 6 is very small (and so a is very large) then we treat the 
problem in a similar way to the early steps of |DFI2j . We go back to the original definition 
(15. 3p . open the square, and execute the integral, obtaining 
(5.62) 



Our plan is to solve the equation a\b\ — a 2 b 2 = h 7 ^ 0 via Poisson summation in 02 (mod 61 ). 
The contribution from h = 0 is O(AlogT), by a trivial bound. First we extract the greatest 
common divisor d = ( 61 , 62 ) which necessarily divides h. In this way we obtain 


(5.63) 


where 

(5.64) 


A E e < b )E I] 


% 


db± 

Bo 


go 


dbA 
Bo ) 


h^O 


d\h (61,62)=! 


(dbi ) V 2 + 2 * T (db 2 ) i/ 2 - 2 *r 


■ Z —R(b 1} b 2 ,h), 


R(bi, 62 , h) — Y, 


w 2 


daibi 

M 


W 2 


da,2b2 \ 

M J 


a\b\-a,2b2 = - 




w 1 


A . 

-ir log 


1 + 


h/d 

a 2 b 2 


We next re-interpret the equation aibi — a 2 b 2 = ^ as a congruence a 2 = — 6 2 ^ (mod 61 ), and 

h | 0,2^2 

replace every occurrence of ai with - . Therefore by Poisson summation in a 2 (mod 61 ), 

we have 


(5.65) R( 6 i, 62 , h) = — 

0 1 


ZGZ 


lb 2 (h/d 


61 


"OO U 12 


h+drb2 

M 


W 2 


drb2 \ 

M 1 


h+drb2 . 
db 1 ' 


■ )l/2 


-Mil 


2 vr 


log (1 + 


h/d\ 
rb 2 J 


Ir 

61 


dr. 


Here the integral is of the form q[y) = f q(r)e(—ry)dr where q is a function supported on 
r ~ £ « f. satisfying ,«(r) « (£)-', and with 9 = A Thus 9 ( 9 ) «, Now 


we have, with y = i/ 61 , that 


(5.66) 


\y\M 

db 2 


m 

dbib 2 


» |(| w » 


and since we assume M ^ T 1 / 2 ^, and a < 1/4, if l 7 ^ 0, then q(l/b\) is very small. For this 
reason, we only need to consider / = 0 . 

Remark. Although there is a possibility for further analysis of the terms with l 7 ^ 0, I 
have so far been unable to improve on Proposition 14.11 using this approach. The reason is 
that Lemmas 15.51 and 15.61 already indicate that the important ranges are A = A 1 / 2-1 " 0 ^, and 
B = r 1/2+o(1) . The results in the literature on this problem ( [DFIlj and [BCTj ) are nontrivial 
if A is not much smaller than B. This means that A needs to be T 1-5 for some small S > 0 
(the recent work of Bettin and Chandee [ BCj perhaps allows any S < 1/14). Meanwhile, in 
Section Owe have given strong estimates in the range A 3> T 25 / 27 + £ using spectral methods. 
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Now we return to the proof. The terms with h y 0 contribute to Ib 0 an amount 
(5.67) Aj2 e ( hx )Yl X 


h +0 


d\h ( 61 , 62 )= 


n db An I dfc 2 

9o {b^J g ° [b^ 
i (db 1 y/ 2 + 2iT (db 2 y/ 2 


—2iT 


1 

h 


. , h+db 2 r 

p0 ° W 2 l 


W 2 


drb2 \ 

M ) 


' h-\-drb 2 
^ db\ 


r y /2 


Wi 


A , / h/d 

T log ( 1 + TT 


dr, 


plus a small error of size 0(T 10 °). By changing variables r V we obtain that (15.67j) 

equals 

(5.68) 


dbi 


90\ B 0 /Shi Fin 


db 2 


Bo 


°°W 2 [m)W 2 


r-\-h 

M 


A^e(/ix)^ X 7 l+2iT 1 1—2iT / i D\l/2 

/ t yo c/1/7 d ( bl M)=i bl 62 • /o + 


u /1 


A log (l + ^)) dr. 


At this point, the integral is independent of b\ and b 2 . Note that if we restrict attention to 
d > D > 1, then by a trivial bound, we obtain 


(5.69) 


\1 Af 9 iW 9 

A E^>°s 2 B o«-iog 2 r. 


d>D 


D 


1/2 

We set D = B 0 , so that this error term is satisfactory. 
Next we claim that if d <C D = B X J 2 , then 


(5.70) 


n I Al. 

So l B 0 


n I d6 2 

Sol 


X h l+2iT h l-2iT 
( 61 , 62 )=! 0l ° 2 


|C(1 + 27W)| 
C(2) 


+ O c ((logT)- 


where c > 0 is arbitrarily large. For this, we simply apply a double Mcllin transform, 
obtaining that the left hand side of (j5.70j) equals 


(5.71) 


(ML 


r (l) 


fB 0 yi+ s 2~ C(l + 2zT + si)C(l-2iT + s 2 )^ A 

It) s ° (Si)So(S2) - C(2+Sl + S2) - ds ^- 


Integrating by parts once, we have that for Re(s) > 0, 


(5.72) 


Jots) = — / g\x)x’dx. 


Since go is identically 1 for 0 < x < 1, go has a pole at s = 0 (and nowhere else). Furthermore, 
Res s= o/ 7 o(s) = — Jo°° g'(x)dx = 1. We move the si integral to G\ = — ^.^ 2 / 3 ; some a > 0, 
followed by the same process for the si integral. By the rapid decay of g 0 , the residues at 
Sx = —2 iT and s 2 = 2 iT are very small. Using Lemma EFT! (Vinogradov-Korobov). we obtain 
that the left hand side of (15.701) equals 

(5.73) IC<1 C(2 2 ) T)I + O(0 ° g T)l ' 3 6XP ( ~ P (i^N ’ 


for some constant (3 > 0. Since we assume Bo S> T e , this error term is 0((logT) c ) for c > 0 
arbitrary. 
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Applying f!5.70p to (15.68(1 and extending the d-surn back to all the divisors of h (without 
making a new error term), we obtain for arbitary c > 0 

(5.74) 


lBo ~ A 


IC(l + 2 iT)| : 
C(2) 


/»oo 

/ a_i(h)e(hx) 
o 


W 2 ImI ®2 


M ) 


(r(r + h )) 1 / 2 




A log (l-A))* 


(A 


AT 


(logT)^ 

At this point we can use Taylor expansions, (essentially we think of jj -C even though 

this is not literally true), giving now 

(5.75) 


I Bo — A 


lC(l + 2 iT)Y 
C(2) 


w 2 {ry 


Ya_ l{ h)e(hx)vrA-^-)dr + 0( 'T 1 A A 
^ w v ' V 2nMr ) Vmin(A, (logT) 50 ) 


M|C(l + 2zT)h 


By comparison to the expression for MTod given by (|4.26jl . we have shown that these terms 
equal M ■ MTod plus a satisfactory error term. Since we already showed (14.2411 . the proof 
here is now complete. 

6 . The smallest A regime 
In this section we show Proposition 11.61 that is, 

(6.1) [ \F{it)\ 2 dt < (logT) £ . 

J|t-T|«TV 100 

The previous methods do not cover this range because Stirling’s formula is no longer 
applicable. Instead, we go back to the original definition (12.7p . Obviously, 


( 6 . 2 ) 


|A(l/2 + it ± iT)\ 2 dt < 1, 


so that by a use of Cauchy-Schwarz, we have 


(6.3) 

where 

(6.4) 

and where 


' |t-T|«T 1/10 ° 


K(A,T,x) = 


\F{it)\ 2 dt 1 + 


|C(1 + 27T)|; 




A <T 1 / 100 


E- 

'\t-T\xA 1 n>1 


T iT (n)e(nx) 


y^l/2 -\-it 


k(n, f, T ) 


dt , 


(6.5) 


k(n : t, T ) = 


2n i 


(°0 


y>(—u)n u cosh(7rT/2)7vy (1/2 + it + u)du, a > 


When A x 1, we need to re-interpret \t — T\ x A as \t — T\ <C 1 (we do not make new 
notation for this range since we will shortly extend this range to \t — T\ <C (logT) 


100 


m 


»). 

Next we split the rt-surn into three ranges: n <C N 0 , N 0 <C n <C Nq , and n A- Nq, where 

N o = YYff’ iVo + = (AU 1/2 (iogr)*. 


( 6 . 6 ) 
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Let us call these three intervals J\f tl i = 1,2,3, respectively. To simplify the analysis, we 
suppose that these three ranges are detected by a smooth partition of unity, say 71 + 72+73 = 
1 , with 7 i respective to J\f t . 

Our goal is to relate K{ A, T, x) to expressions to which we may apply the results of Section 
0 To this end, we have 


Lemma 6.1. Suppose that A O (log T) 100 . Then 


(6.7) K(A, T, x) <C 

If A <C (log T ) 100 , then 

( 6 . 8 ) A'(A,T,x)< £ 


' |t—T|xA 


(AT)- 


E 


T iT (n)e(nx)^ 2 (ri) 


N 0 <Cn<CiVg" 


n l/2 +it 


dt + 


|C(1 + 2iT)p 
(log T) 2 


Ts 


+-T|<(IogT)ioo 


E 


T iT (n)e(nx)7 2 (n) 


N 0 


^l/2+it 


dt + 


|C(l + 2zT)p 

(logT )2 


The error terms here contribute <C (logT) 1 to (16.11) . after summing over the O(logT) 
values of A in (16.3p . which is satisfactory. 


Proof. By Cauchy’s inequality, it suffices to bound the three ranges A/) separately. For 
n -C Nq (that is, i = 1), we take a = —1, and for n Nf~ ( i = 3), we take a = 1/e 2 
(large). In each n- range, we then move the u- integral outside and apply Cauchy-Schwarz in 
the following form (similarly to (14. 9 1) above): 


(6.9) 


E 

nGA/i 


T iT {n)e{nx)^i(n) 1 


y^l /2 + Zt 

< I 


2ni 


(+ 


u cosh( 7 rT/ 2 ) 7 y T (l /2 + it + u)du 


dt 


t L X J V 


(—u)| cosh( 7 rT)| 7 y T (l /2 + it + v)\ 2 dv 

T iT {n)e{nx)7i{n\ 2 


([ m- 

-«)ll 

^ J u 

* 


E 

neAfi 


/ 2-\-it-\-u 


du 


dt. 


We next argue that 


( 6 . 10 ) 


(—w)| cosh( 7 rT)| 7 y T (l /2 + it + v)\ 2 dv (1 + \t - T\) a 2 (1 + |t + T|) <t 2 . 


By Stirling, the left hand side above is bounded by 


( 6 . 11 ) 


{-a-iy)\(l + \t + T+ y\) a 2 (1 + |f — T + y\) a 2 dy. 


The main range of the integral is \y\ < 1 + \\t — T\, which by a trivial bound leads to the 

claimed bound. If \y\ > 1 + \\t — T\, then the rapid decay of can be used to show that 
this part of the integral is negligible by comparison. Hence 

3 ' ,0 ° Ti T (n)e(nx)^ii(n) 2 


(6.12) iL(A, T, x) « ^ [ (AT)^ T \^- a -iy)\\j2- 

i= l U rr.\. _A neMi 


n 


1/2 +(Ti+it.+iy 


dydt. 


\t-T\xA 


To focus on the important parts here, we shall apply the mean value theorem for Dirichlet 
polynomials (combined with Lemma 12.41) in some ranges where this is acceptable. In (I6.12j) . 
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with i — 1 (so G\ — —1/4), we have the bound 


(6.13) 

(AT ) -3 / 4 Yi 

tk^Nq 


TiT{n) I 2 

n‘/2 


(A+ra) < 


|C(1 + 2 iT )| 2 logT 
(AT ) 3 / 4 


(A(iY 0 -) 1 /2 + ( iVo -) 3 / 2 ) « 


IC(1 + 2 iT )| 2 

(logT ) 2 


consistent with Lemma 16.11 A similar (even stronger) bound holds for A/ 3 , and we omit 
the details as it is even easier than the above case (it saves an arbitrary power of logT). 
Therefore, 


(6.14) 


K(A, T, x) <C J (AT)-5 

|t—T|xA 


IV’Hz/)! 


E 

Y 0 <n<7V+ 


Tj T (n)e(na;)72(n) 


2 

dydt 


K(1 + 2fT )| 2 
(logT ) 2 


If A (logT) 100 , then we can use an argument as in the proof of Lemma T4.21 to truncate 
the ^-integral at sA, and then over-extend the /-integral slightly. The error term obtained 
in this way saves a large power of A which is acceptable for these values of A. This proves 

fUD- 

Next suppose A -C (logT) 100 . In this situation, we truncate the y-integral at (logT) £ , 
which by the rapid decay of introduces an error term that saves an arbitrarily large power 
of logT over the trivial bound, which is satisfactory. We then use a simple over-estimate in 
the following shape: 


(6.15) 



, |t-T|<(logT) 1 oo 


\f(t)\ 2 dt. 


At this point, the /-integral is longer than the y-integral, so we can again change variables 
/->•/-* y and double the length of /-integration to separate the variables / and y. This 
proves (16.81) . □ 


Now we are almost in a position to apply the results from Section [5] to prove (16.11) . First 
suppose A S> (logT) 100 . We need to apply a dyadic partition of unity inside the n-sum, and 
apply Cauchy’s inequality to focus on one dyadic segment at a time. Since log(A/o" /Nq) -C 
log logT, there are only O(loglogT) such dyadic segments to consider. Then Lemmas 15.41 
and O show 

(6.16) K(A, T, x) « |C(1 + 2 ZT)| 2 (logT) £ ((logT ) -50 + S A ), 

which is satisfactory (recall that 1)- The factor (logT) £ comes about because 

(log ry. 

Next suppose A -C (logT) 100 . Again we can apply Lemmas 15.41 and 15.71 to obtain the 
same bound as (16.161) . however , there is a technical difficulty in showing Sa *C 1 here, 
because in Lemma 14.31 we assumed that w\ has support on the positive reals. Here the weight 
function attached to the /-integral has compact support on R, but not (0, 00 ). Nevertheless, 
it is easy to show that Q(x, H ) <C 1 for an individual H 3> 1 and W\ a fixed Schwartz-class 
function, which in this context means SA -C 1. Since there are only O(loglogT) dyadic 
values of A with A -C (logT) 100 , this bound is satisfactory for (16.11) . 
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7 . Upper bounds 


7.1. Proofs. In this section, we prove Theorem 11.21 First we recall two “endpoint” bounds. 
One, where \t — T\ <C T 1//10 ° is given by Proposition 11.61 For the other end, combining 
Propositions 13.11 and 14.11 gives 

(7.1) [ \F[it)\ 2 dt <C logT. 

■f \t\<T—CT 26 / 27 

Next we consider the intermediate ranges T 1 / 100 A <C T 26 / 2 '. By Lemma [4.21 we need 
to bound 

(7.2) \((1 + 2iT)\- 2 (AT)- 1/2 I(A,T,x,N). 

T e <A<T 1 “ e 

dyadic 

The mean value theorem for Dirichlet polynomials (Lemma 12.21) , combined with Lemma l2~4l 
gives that (17. 2\i is 

(7.3) « £ E M»>l’(f+i) « E i°gr«log 2 T. 

T £ <A<T 1 - £ ' ' nx(AT ) 1 / 2 T e <A<T 1 ^ £ 

dyadic dyadic 

This proves (11.8fl . To show (11.911 . we instead apply Corollary 15.91 


7.2. Remarks on the sieve method for shifted convolution sums. Here we give an 
informal discussion of estimates for J(A,T, x, N) using Holowinsky’s sieve method [ Ho) . 

Consider the range T £ •C A <C T 1_£ (we could restrict A further to T 1 / 2-6 <c A 
T 25 / 27+£ , but this is not helpful for this discussion). By (14. 12)1 . we have 

A / I /i IA \ —loo _—^ 

(7.4) I(A,T,x, N) < — (l + Y,Mn)T iT {n + h)\. 

h nxN 


Recall we desire to estimate (17.211 . The term h = 0 gives a term of size A|£(l + 2iT) logT, 
which is more than acceptable, so we focus on h > 1. By m Theorem 15.6] which gives a 
slightly different version of a result of Holowinsky ra, we have 


(7.5) 


E EtMietE+ h )\« 

nxIV 


J^-NE iT {N) 2 L iT (N)\ 


where 

(7.6) 


E iT {x) 


1 

logx 


n(i+ 

p<x 


|Rr(p)| 

P 


) 


and L iT (N) -C log log A. One can show 

(7.7) E iT (N) < (logT”) 1 / 3 |C(l + 2 zT”)| 7 / 9 |C(1 + AiT)\~ l /\ 

To see this, we begin with the inequality 

(7-8) Mp)\ < ^(8 + 11|tjt(p)| 2 - \r iT {p)\')■ 

Thus 

(7.9) \r iT (p)\ < ^(24 + 7 (p 2iT + p~ 2lT ) - (p 4iT + p~ 4iT ), 
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which implies 

(7.10) Ei T (x) < (log x ) —1 JJ 

p<x 





p<x 


1 



p<x 


Of course, ]/[ <;E (1 + P *) ^ logx. It follows from standard methods 
and (12.511 that 


1 + 


P 


1+4 iT 


-1/9 


from Davenport’s book 


(7.11) = lo gC(! + 2 iT) + 0(1), 

p<x 

provided log a: /§> (log T) 2//3+e . This shows (17.7(1 . 

Therefore, we have 

, 7m i(a,t,x,n) (log r) 1/3+> 

y 1 |C(l + 2iT)| 2 (AT) 1 / 2 |C(l + 2iT)| 11 / 9 |C(l + 4jT)| 1 / 9 ' 

The problem with using this is that the best bound for the £’s that we may claim is |£(1 + 
it)] -1 <C (log |t|) 2 / 3+e which gives a weaker bound than that in (17.31) . Alternatively, one may 
say that |£(1 + it )| _1 <C (log log \t\) for almost all t, but then one arrives at a bound worse 
than (11.911 . 

This line of thought indicates that one should choose a different polynomial inequality 
than (17.811 , in order to optimize the total power of log T that occurs on the right hand side 
of (17.12j) . We performed a computer search for the optimal choice available from the bounds 
|C(1 + ^)| ±1 (log \t\) v+e , with r] = 2/3 and with polynomials of degree < 6, and found 
no improvement on (17.3p . Numerical experiments indicated that an improvement on (17.3p is 
possible with < 1/2, but in this case (11.911 also improves on (11.811 . 


8 . Rational x\ the proof of Proposition 11.71 


As mentioned in the remark following Proposition 11.61 to show Proposition 11.71 all that is 
required is to show 


( 8 . 1 ) 

for \x — x 0 | < and x 0 E 
( 8 . 2 ) 


‘ -U,t(+| 2 41 = o(logT), 

TV 2 - £ <|t-T|<T 25 / 27 + E 

By Lemma [4.21 it suffices to show 
I(A,T,x,N) 


E 


t- 49 <a<t s 


|C (1 + 2 TT)| 2 (AT ) 1 / 2 


< 1 . 


Using the facts that n x N = (AT ) 1 / 2 <C T 97 , and 7 is bounded by a small power of T, we 
have e{nx) = e(nx 0 ) +O(T~ 01 ), for all n,x appearing in (18.2p . Therefore, it suffices to show 
(18. 2 p for x replaced by ^o- 

In the course of proof, we shall need the following 


Proposition 8.1 (Jutila-Motohashi |JM] ). Let 1 C 1/ C 7, and let q be a positive integer. 
Then 

(8.3) [ l^(l/2 + At, x)\ 4 du « q 1+£ (U + U 2/3 ) 1+e ; 

Jv<\u\<v+U x ( mod q) 

This is a generalization of a result of Iwaniec jlwj . We also require the following change 
of basis formula. 
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Lemma 8.2. Suppose that c n is a finite sequence of complex numbers, q > 1 is an integer, 
and (a, q) — 1. Then 


(8.4) 



E T (x)x(“)XA< i »x(n). 

X (mod q/d) n 


If q > 1 is an integer and b e Z, then writing b = ( b, q)b', q = (b, q)q > , we obtain 


(8.5) 



1 

fi(q') 


E T (x)x(b'). 

X (mod q') 


This quickly gives the Lemma by writing (n, q) — d and changing the order of summation. 


Proof of (I8.1lh Suppose x 0 = a/q, with (a, q) = 1. We will show 


( 8 . 6 ) 


/(A, T, N) «, JV|C(1 + 2iT)\‘S A + NT-\ 


where 5 > 0 and S'a has the properties stated in Proposition 14.11 This is sufficient for (18. ip . 
Using Lemma [8.21 we have 

7"(y)y(d) .— Tmlrln)"v(n\ /rln\ 2 


(8.7) /(A, T, N) — 

q 


w i 


E E 


d\q x (mod q/d) 

The inner sum over n has a generating function 

T iT (dn)x{n 


f(q/d) 


E 

n> 1 


TiT(dn)x{n) / dn 
(dn) 1 / 2 ~ it + iTW2 \ N 


dt. 


( 8 . 8 ) 


E 

71=1 


(dn) 


= L(s + TT , x)L(s - iT, x)Z d , x (s), 


where Z d (s ) is a hnite Euler product over prime powers dividing d, satisfying Z d (s ) -C d CT+£ , 
uniformly for Re(s) = a > .01. By the Mcllin transform method, we have 


(8.9) I(A,T,-,N) 

q 


W 1 


E E 

d\q X (mod q/d) 


T (x)x(a) 

f(q/d) 


1 

2ni 



N s w 2 (s)L( 1/2 + s 


it + 2 iT, x)L( 1/2 + s 


it, x)Z d , x (^/^ — it + iT + s)ds 


2 

dt. 


We move the contour to the left, to the line Re(s) = 0. When x = Xo is the principal 
character, then we cross poles at s = 1/2 + it — 2 iT, and at s — 1/2 + it. The former pole is 
very small since W 2 is O(T _10 °) at this point. The latter pole requires closer scrutiny, which 
we defer for a moment. Applying Cauchy-Schwarz, we see that the new line of integration 
gives 


( 8 . 10 ) 

<C q £ 



|w 2 (s)| L(l/2 + s-it+2iT,x)L(l/2 + s-it,x) 


2 

dsdt. 


Using Cauchy-Schwarz again, and Proposition 18.11 we bound this part by 
(8.11) g 1+£ T e (A 1/2 + T 1/3 )A 1/2 . 

Since q is fixed, and A <C T 94 , this provides a power saving over (AT) 1 / 2 , so this is more 
than sufficient towards the proof of (18.11) 
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Now we turn to the pole from s = 1/2 + it that occurs for X = Xo- Conceptually, it 
should be clear that the main term could have been calculated using the known residues 
of the Estermann function and should therefore agree with other main terms that we have 
encountered in a variety of ways thoughout this work. Nevertheless, it is easiest to carry 
forward directly. We have that this pole contributes to J(A,T, |,iV) an amount 

(8.12) < N\({1 + 2iT)\ 2 J w 1 ^\HT 2 {1/2 - it)\ 2 dt. 

Here we need to use that w\(t/ A) <C 1, uniformly in t G R, where the sum is over dyadic 
values of A. Hence we may interpret this as an expression of the form 1V|£(1 + 2iT)\ 2 S&. □ 
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